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IN THE UNITED STATES PATENT AND TRADEMARK 
REQUEST FOR FILING NATIONAL PHASE OF 
PCT APPLICATION UNpER~gTDjjjkC. 371 AND 37 CFR 1.494 OR 1.495 

Asst. Commissioner of Patents / ^\ (Our Deposit Account No. 03-3975 

and Trademarks 
Washington, D.C. 20231 



To: 



TRANSMITTAL LETTER TO THE UNITED 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 




Atty Dkt: PM 273884 



/P00428US 



M# /Client Ref. 

From: Pillsbury Madison & Sutro LLP, IP Group: Date: September 18, 2000 (MONDAY) 

This is a REQUEST for FILING a PCT/USA National Phase Application based on: 



4. 



International Application 

PCT/GB99/00816 

country code 



2. International Filing Date 



17 
Day 



MAR 1999 
MONTH Year 



3. Earliest Priority Date Claimed 
17 MAR 1998 



Day MONTH Year 
{use item 2 if no earlier priority) 
Measured from the earliest priority date in item 3, this PCT/USA National Phase Application Request is being 
filed within: 

(a) □ 20 months from above item 3 date (b) M 30 months from above item 3 date, 

(c) Therefore, the due date (unextendabie) is September 17, 2000 (SUNDAY) 

Title of invention NUCLEIC ACID BINDING PROTEINS 



6. Inventor(s) CHOP, Yen 

Applicant herewith submits the foiiowing under 35 U.S.C. 371 to effect filing: 



7. [X] Please immediately start national examination procedures (35 U.S.C. 371 (f)). 

8. [X] A copy of the International Application as filed (35 U.S.C. 371(c)(2)) is transmitted herewith (file if in 
English but, if in foreign language, file only if not transmitted to PTO by the International Bureau) including: 

a. !El Request; 
' b. Abstract; 

47 pgs. Spec, and Claims; 
d. 7 sheet(s) Drawing which are □ informal Kl formal of size A4 □ 1 1" 

9. M A copy of the International Application has been transmitted by the International Bureau. 

10. A translation of the International Application into English (35 U.S.C. 371(c)(2)) 
a. □ is transmitted herewith including: (1) □ Request; (2) □ Abstract; 

(3) pgs. Spec, and Claims; 

1 (4) sheet(s) Drawing which are: 

□ informal □ formal of size □ A4 □ 11" 
J). □ is not required, as the application was filed in English. 

c. □ is not herewith, but will be filed when required by the forthcoming PTO Missing Requirements 

Notice per Rule 494(c) if box 4(a) is X'd or Rule 495(c) if box 4(b) is X'd. 

d. □ Translation verification attached (not required now). 
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* RE4USA National Filing of PCT/GB99/00816 ^ ^ QQ ^ pQT/pj3 1 8 SEP2000 

11.^. [gf PLEASE AMEND the specification before its first line by inserting as a separate paragraph: 

a. S -This application is the national phase of international application PCT/g399/QQ816 

filed March 17. 1 999 which designated the U.S -- 

b. □ -This application also claims the benefit of U.S. Provisional Application No. 

60/ t filed .-- 

12 □ Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 

371 (c)(3)), i.e., before 18th month from first priority date above in item 3, are transmitted 
herewith (file only if in English ) including: 

13. PCT Article 19 claim amendments (if any) have been transmitted by the International Bureau 

14 □ Translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)), i.e., of 

claim amendments made before 18th month, is attached ( required by 20th month from the date in 
item 3 if box 4(al above is X'd. or 30th month if box 4(b) is XU or else amendments will be 
considered canceled ). 

15. A declaration of the inventor (35 U.S.C. 371(c)(4)) 

a. □ is submitted herewith □ Original □ Facsimile/Copy 

b M is not herewith, but will be filed when required by the forthcoming PTO Missing Requirements Notice 
per Rule 494(c) if box 4(a) is X'd or Rule 495(c) if box 4(b) is X'd. 

_ 16 An International Search Report (ISR): 

O a. Was prepared by European Patent Office □ Japanese Patent Office U Other 

b EES has been transmitted by the international Bureau to PTO. 
Cm c. [X] copy herewith (3 pg(s).) [x] plus Annex of family members (I pg(s).). 

A 17. International Preliminary Examination Report (IPER): 

f g ! a . El has been transmitted (if this letter is filed after 28 months from date in item 3) in English by the 
m International Bureau with Annexes (if any) in original language. 

J". I " b. Kl copy herewith in English. 

- c.1 iper Annex(es) in original language ("Annexes" are amendments made to ciaims/spec/drawings 
j\ during Examination ) including attached amended: 

f* c.2 □ Specification/claim pages # claims # 

!f Dwg Sheets* lh . * ■ . 

ry d n Translation of Annex(es) to IPER ( required by 3 0 th month due date, or else annexed 

amendments will be considered canceled) . 

□ 18. information Disclosure Statement including: 

a. Attached Form PTO-1449 listing documents 

b. [X] Attached copies of documents listed on Form PTO-1449 

c. A concise explanation of relevance of ISR references is given in the ISR. 

19 □ Assignment document and Cover Sheet for recording are attached. Please mail the recorded 

assignment document back to the person whose signature, name and address appear at the end of 
this letter. 

20. □ Copy of Power to IA agent. 

21 . □ Drawings (complete only if 8d or 1 0a(4) not completed): _ sheet(s) per set: □ 1 set informal; □ 

Formal of size □ A4 □ 1 1 " 

22. □ (No.) Verified Statement(s) establishing "small entity" status under Rules 9 & 27 

23 Priority is hereby claimed under 35 U.S.C. 1 19/365 based on the priority claim and the certified copy, both 
filed in the International Application during the international stage based on the filing 
in (country) Great Britain of: 

Application No. Filing Date Application No. MA^Sf^ 

(1) 9805576.7 17 MAR 1998 (2) 9806895.0 31 MAR 1998 

(3) 9807246.5 03 APR 1998 (4) _ 

(6) 



(5) 



inS! See Form PCT/IB/304 sent to US/DO with copy of priority documents. If copy has not been 

received, please proceed promptly to obtain same from the IB. 
b. □ Copy of Form PCT/IB/304 attached. 
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* RE: USA National Filing of PCT/GB99/00816 
24. Attached: 



09/646353 

Page 3 of 3 

534 Rec'd PCT/PTG 1 8 SEP 2000 



25. Preliminary Amendment: 

25.5 Per Item 17.c2, cancel original pages # , claims # , Drawing Sheets # 

26. Calculation of the U.S. National Fee (35 U.S.C. 371 (c)(1)) and other fees is as follows: 

Based on amended claim(s) per above item(s) □ 12, □ 14, □ 17, □ 25, □ 25.5 (hilite) 



Total Effective Claims minus 20 = 

Independent Claims minus 3 = 

If any proper (ignore improper) Multiple Dependent claim is present, 



x $18/$9 = $0 
x $78/$39 = $0 
add$260/$130 +0 



966/967 
964/965 
968/969 



BASIC NATIONAL FEE (37 CFR 1.492(a)(1)-(4)): BASIC FEE REQUIRED, NOW 

A. If country code letters in item 1 are not "US"."BR". tt BB".'Tr."MX w ."IL" "NZ W , "IN" or "ZA" 

See item 16 re: * 

1 . Search Report was not prepared bv EPQ or JPO add$970/$485 

2. Search Report was prepared by EPO or JPO - - - add$840/$420 +840 



+0 



CP SKIP B, C, D AND E UNLESS country code letters in item 1 are tt US"."BR"."BB","TT","MX","IL", "NZ", "IN" or "ZA" 

Cfl □ B. If USPTO did not issue both International Search Report 
iQj W (ISR) and (if box 4(b) above is X'd) the International add$970/$485 
yi Examination Report (IPER), - - 

(Of) 



□ C. If USPTO issued ISR but not IPER (or box 4(a) above is 



X'd), 



add$690/$345 +0 



U {these) 

— ( 4) □ D. If USPTO issued IPER but IPER Sec. V boxes not all 3 

YES, - - add$670/$335 

□ E. If international preliminary examination fee was paid to 
USPTO and Rules 492(a)(4) and 496(b) satisfied (IPER 
Sec. V all 3 boxes YES for all claims), - - add $96/$48 



+0 



960/961 
970/971 



960/961 



958/959 



956/957 



962/963 



+0 



27. 

28. 
29. 



SUBTOTAL = $840 



If Assignment box 19 above is X'd, add Assignment Recording fee of — $40 
Attached is a check to cover the - TOTAL FEES 



+0 



(581) 



$840 



Our Order No. 



71278 


273884 


c# 





CHARGE STATEMENT : The Commissioner is hereby authorized to charge any fee specifically authorized hereafter, or any missing or insufficient fee(s) filed, or asserted to be 
filed, or which should have been filed herewith or concerning any paper filed hereafter, and which may be required under Rules 16-18 and 492 ( missing or in sufficient fee only) now 
or hereafter relative to this application and the resulting Official document under Rule 20, or credit any overpayment, to our Account/Order Nos. shown above for which purpose a 
duplicate copy of this sheet is attached. 

This CHARGE STATEMENT does not authorize charge of the issue fee until/unless an issue fee transmittal form is filed 



Pillsbury Madison & Sutro LLP 
Intellectual Property Group 



1100 New York Avenue, NW 
Ninth Floor 

Washington, DC 20005-3918 
Tel: (202) 861-3000 
Atty/Sec: PNK/sdm 



i*V Atty 



Paul N. Kokulis 



Reg. No. 16773 



Fax: 
Tel: 



(202) 822-0944 
(202) 861-3503 



NOTE: File in duplicate with 2 postcard receipts (PAT-103) & attachments. 
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Inventor(s): CHOP, Yen : ( Att V- Dkt 

Appin. No,: 09/648,353 or Patent No.: 273884/PQ0428 

Filed: September 18, 2QQQ or issued.; ; M# / Client Ref. 

Title: NUCLEIC ACID BINDING P ROTEINS 

SMALL ENTITY STATEMENT CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(d) and 1.27 (c)) - SMALL BUSINESS CONCERN 

I hereby state that I am 

□ the owner of the smaJI business concern identified below: 

£3 an official of the small business concern empowered to act on behalf of the concern identified below: 
NAME OF CONCERNi a^nda^J-imited 

ADDRESS OF CONCERN 1-3 Burtonhofe Lane, Mill HUH. London NW7 IAD. United Kingdom 

1 hereby state that the above identified small business concern qualifies as a small business concern as defined in 13 
CFR 121.12, and reproduced in 37 CFR 1.9(d), for purposes of paying reduced fees under Section 41(a) and (b) of 
Title 35, United States Code, in that the number of empioyees of the concern, including those of its affiliates, does not 
exceed 500 persons . For purposes of this statement, CO the number of employees of the business concenrns the 
average over the previous fiscal year of the concern of the persons employed on a full-time, part-time or temporary 
basis during each of the pay periods of the fiscal year, and (2) concerns are affiliates of each other when either, 
directly or indirectly, one concern controls, or has the power to control the other, or a third party or parties controls or 
has the power to control both. 

I hereby stat e that rights under contract or law have been conveyed to and remain with the smafil business concern 
identified above with regard to the invention entitled: NUCLEIC ACID BINDING PROTEINS 
by inventor^ ) Yen CHOP described in 

x -> □ the specification filed herewith, 
one El Application No. 009/646,353 , filed September 18, 2Q0Q 
box -> □ Patent No, , ri issued 

If the rights held by the above Identified small business concern are not exclusive, each smag entity individual, concern or organization having nghfct to the invention is listed In f A) 
and f 51 1 freiow and no rights to the invention ere held by any person, other than tha inventor, who could not qualify under 37 CFR 1.9(c) as an independent Inventor if that person 
had made the invention, or by any concern which would not quality m a small business concern under 37 CFR 1 .9(d) or a nonprofit organization under 37 CFR 1 .9(e). 

(A) FULL NAME of assignee/licensee/grantee/conveyee* 
ADDRESS 

□ INDIVIDUAL □ SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

(B) FULL NAME of assignee/licensee/grantee/conveyee* 
ADDRESS 

□ INDIVIDUAL □ SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

*NOTH: Separate statements required item each person, concern or organization named In (A) and (8) above having rights to the Invention, averring to ras/herfte 
status as a small entity, (37 CFR 1-27) 

1 aeknowferiqe the duty to file, in this case, notification of any change in 3tatus resulting nfossjrterrgaetnent.to smalt entity status prior to paying, of at tha time of paying, the 
earliest of the issue fee or any maintenance fee due after the date on which status as a smaE entity is no longer appropriate. (37 CFR 128{b}) 



NAME OF PERSON SIGNING 
TITLE OF PERSON OTHER THAN OWNER 
ADDRESS OF PERSON SIGNING 



SIGNATURE 



^\f^(^^^r^_^^^ DATE ^° r ^c'A^ 61} 
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Filed: Se ptember 13, 2000 or Issued: 



For NUCLEIC ACID BINDING PROTEINS 



SMALL ENTITY STATEMENT CLAIMING SMALL ENTITY 
STATUS (37 CFR 1.9(f) and 127 (htt - INDEPENDENT INVENTOR 

As a below named inventor, I hereby declare that I qualify as an independent inventor as defined in 37 CFR 1.9(c) for 
purposes of paying reduced fees under Section 41(a) and (b) of Title 35, United States Code, to the Patent and 
Trademark Office with regard to the invention entitled as above and described in 



X □ 
one -> 



the specification filed herewith 
Application No. 09/646.353 . filed September 18, 2000 



box □ Patent No. , issued 

I have not assigned, granted, conveyed or licensed and am under no obligation under contract or law to assign, wnvey 
oTiicense any rights in the invention to any person who could not be classified as an independent inventor under 37 CFR 
19(c) if that person had made the invention, or to any concern which would not qualify as a small business concern 
under 37 CFR 1 .9(d) or a nonprofit organization under 37 CFR 1 .9(e).: 

Each ( small entity) person, concern or organization to which I have assigned, granted, conveyed, or licensed or am 
under an obligation under contractor law to assign, grant, convey, or license any rights in the invention 

X □ there is no such person, concern, or organization. 

one M such persons, concerns or organizations are listed in (A) and (B) below: 

(A) FU LL NAME of assignee/licensee/grantee/conveyee* Gendaa Limited 

AnrtRESS 1-3 Burtonhole Lane- Mill Hill. Lond on NW7 1AD. United Kingdom k( „ ATlrtM 

□ INDIVIDUAL El SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

(B) FULL NAME of assignee/licensee/grantee/conveyee* 

□ INDIVIDUAL^ □ SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

•NOTE- Separate verified statement is required from each person, concern or organization 
named in (A) and (B) above having rights to the invention, averring to his/her/its 
status as a small entity. (37 CFR 1.27) 

I ^knowledge the duty to file, in this case, notifi c ation Q f anv change in status resulting in loss of entitlement to small 
erjtjtistahjs prior to paying, or at the time of paying, the earliest of the issue fee or any maintenance fee due after the 
date on whfch status as a small entity is no longer appropriate. (37 CFR 1 .28(b)) 



1 CHOO Yen _____ 2. 3- 

InA/!E OF INVpifPR NAME OF INVENTOR NAME OF INVENTOR 



'^MEOF INVjpr| 



Signature of Investor Signature of Inventor Signature of Inventor 



Date 



Date Date 
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inventor(s)- CHOP. Yen » Atty OKt 

Appin 09/546.353 or Patent No , PM 273B84/PQ0428US 

t r ' M» ' Client Ref 

Filed September 18, 2000 or issued. 

For NUCLEIC ACID BINDING PROTEINS 



SMALL ENTITY STATEMENT CHIMING SMALL ENTITY 
STATUS (37 CFR 1 9(f) and 1.27 <b)) - INDEPENDENT INVENTOR 

As a oeiow named inventor, i hereby declare mat \ qualify as an independent inventor as defined in 37 CFR 1.9(c) for 
purposes of paying reduced fees under Section 41 (a) and (b) of Title 95, United States Coda, to the Patent and 
Trademark Office with regard to the invention entitled as above and described in 

x □ the specification filed nerewith 

one 63 Application No OS/646 353 filed September 16, 2000 
box □ Patent No. _ . _ . . issued 

l have not assigned, granted, conveyed or licensed and am under no obligation under contract or law to assign, convey 
or license any rights in the invention to any person who could not be classified as an independent inventor under 37 CFR 
1.9(c) if that person had made the invention, or to any concern which would not quality as a small business concern 
under 37 CFR 1 9(a) or a nonprofit organization under 37 CFR 1 .d{*).: 

Eacn fsmau entity) person, concern or organization to whtcn I nave assigned, granted, conveyed, or licensed or am 
unaer sn obligation under contract or law to assign, grant, convey, or license any rights in tne invention 

X □ there is no such person, concern, or organization- 

one -> £3 such persons, concerns or organizations are listed in [Ai and {B} beioi/w: 

{A} FULL NAME of assignee/licenseeVgrantee/conveyee* Gendaq Limited 

ADDRESS 1*3 Buttonhole Lane. Mill Hill London NW7 IAD United Kingdom 

□ INDIVIDUAL E3 SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

(B) Full NAME of assignee/hcensee/grantee/conveyee* 
ADDRESS 

□ INDIVIDUAL □ SMALL BUSINESS CONCERN □ NONPROFIT ORGANIZATION 

"NOTE Separate verified statement is required from each person, concern or organization 
named in (A) and (B) above having rights to tne invention, averring to his/her/its 
status as a small entity (37 CFR 1 ,27) 

I acknowledge the duty id file, in mis case notification of anv change in status resulting in, (oss of entitlement to small 
entity status prior to paying, or at the time of paying, tbe earliest of the issue fee or any maintenance fee due after the 
date on which status as a small entity is no longer appropriate. (37 CFR 1.28(b)) 

1. CHOP. Yen /? ' 2. IS ALAN. Marie 3 



fWW 

Date / ™ 



OF ifivffNTOR - NAME OF INVENTOR NAME OF INVENTOR 

lAW— c^C^ — - 

Signature of Inventor Signatrffe of Inventor Signature of Inventor 



O f 'y*:' 

Date r c Date/ T Date 
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(M#) 

Invention: NUCLEIC ACID BINDING PROTEINS 
Inventor (s): CHOO, Yen 



Piilsbury Madison & Sutro LLP 
Intellectual Property Group 
1 1 00 New York Avenue, NW 
Ninth Floor 

Washington, DC 20005-3918 

Attorneys 
Telephone: (202)861-3000 



This is a: 

□ Provisional Application 

□ Regular Utility Application 

□ Continuing Application 

^ The contents of the parent are incorporated 
by reference 

IE! PCT National Phase Application 

□ Design Application 

□ Reissue Application 

□ Plant Application 

□ Substitute Specification 

Sub. Spec Filed 

in App. No. / 

□ Marked up Specification re 

Sub. Spec, filed 

In App. No / 

SPECIFICATION 



DocumentH 
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Nucleic Acid Bi 

The present invention relates to DNA binding proteins. In particular, the invention 
relates to a method for designing a protein which is capable of binding to a defined 
methylated DNA sequence but not to an equivalent unmethylated DNA sequence. 

Protein-nucleic acid recognition is a commonplace phenomenon which is central to a 
large number of biomolecular control mechanisms which regulate the functioning of 
eukaryotic and prokaryotic cells. For instance, protein-DNA interactions form the basis 
of the regulation of gene expression and are thus one of the subjects most widely studied 
by molecular biologists. 

A wealth of biochemical and structural information explains the details of protein-DNA 
recognition in numerous instances, to the extent that general principles of recognition 
have emerged. Many DNA-binding proteins contain independently folded domains for 
the recognition of DNA, and these domains in turn belong to a large number of structural 
families, such as the leucine zipper, the "heiix-tum-helix" and zinc finger families. 

Despite the great variety of structural domains, the specificity of the interactions observed 
to date between protein and DNA most often derives from the complementarity of the 
surfaces of a protein a-helix and the major groove of DNA [Kiug, (1993) Gene 
135:83-92]. In light of the recurring physical interaction of a-helix and major groove, the 
tantalising possibility arises that the contacts between particular amino acids and DNA 
bases could be described by a simple set of rules; in effect a stereochemical recognition 
code which relates protein primary structure to binding-site sequence preference. 

It is clear, however, that no code will be found which can describe DNA recognition by 
all DNA-binding proteins. The structures of numerous complexes show significant 
differences in the way that the recognition a-helices of DNA-binding proteins from 
different structural families interact with the major groove of DNA, thus precluding 
similarities in patterns of recognition. The majority of known DNA-binding motifs are 
not particularly versatile, and any codes which might emerge would likely describe 
binding to a very few related DNA sequences. 
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Even within each family of DNA-binding proteins, moreover, it has hitherto appeared 
that the deciphering of a code would be elusive. Due to the complexity of the 
protein-DNA interaction, there does not appear to be a simple "alphabetic" equivalence 
5 between the primary structures of protein and nucleic acid which specifies a direct amino 
acid to base relationship. 

International patent application WO 96/06166 addresses this issue and provides a 
"syllabic" code which explains protein-DNA interactions for zinc finger nucleic acid 
10 binding proteins. A syllabic code is a code which relies on more than one feature of the 
binding protein to specify binding to a particular base, the features being combinable in 
the forms of "syllables", or complex instructions, to define each specific contact. 

Our copending UK patent applications, GB 9710805.4, 9710806.2, 9710807.0, 
15 9710808.8, 9710809.6, 9710810.4, 9710811.2 and 9710812.0 describe improved 
techniques for designing zinc finger polypeptides capable of binding desired nucleic acid 
sequences. In combination with selection procedures, such as phage display, set forth for 
example in WO 96/06166, these techniques enable the production of zinc finger 
polypeptides capable of recognising practically any desired sequence. 

20 

Zinc finger domains studied and produced to date are capable of binding to recognition 
sequences composed by any of four nucleic acid bases: A, C, G or T (U in RNA). 
However, the DNA of many organisms includes also a fifth base, 5-methylcytosine 
(5-meC or, in nucleotide sequences herein, M). 5-meC arises from specific methylation 

25 of cytosine, and is used to mark the genome or to increase its information content. 
5-methylcytosine is well known to affect protein-DNA interactions, for instance 
inhibiting cleavage of DNA by certain restriction enzymes. In vertebrates, cytosine is 
frequently methylated when directly preceding guanine, as in the dinucleotide CpG. This 
type of methylation generally down-regulates vertebrate gene expression, and can also 

30 prevent the binding of many eukaryotic transcription factors to DNA. Yet the zinc finger 
transcription factors tested to date, Spl and YY1, are not affected by CpG methylation of 
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their DNA binding sites, suggesting that zinc fingers are incapable of discriminating 
between cytosine and 5-meC. 

Since methylated cytosine bases are involved with many regulatory interactions in gene 
5 expression, and particularly in eukaryotic, including human, gene expression, the 
production of zinc finger polypeptides which specifically target methylated cytosine 
bases would be highly desirable. Such polypeptides, in order to be useful, must be able to 
differentiate DNA sequences in which cytosine is methylated to 5-meC from identical 
non-methylated sequences. 

10 

Further nucleic acid base modifications are known in the art. For example, brominated 
nucleosides are known, such as Br-dU. Being photolabile, brominated nucleosides are 
useful in the determination of DNA-protein complex structure. Br-dU containing 
oligonucleotides are also useful as probes, since antibodies are available which recognise 

15 Br-dU. Moreover, in antisense oligonucleotide chemistry, the use of backbone 
modifications to improve oligonucleotide stability is well known; for example, 
phosphorothioate and 2-0 methylation are commonplace. Such backbone-modified 
nucleosides, and other nucleosides, may also be Co modified. For example, Co propyne 
derivatives and Co methyipyrimidine nucleosides are known and used in antisense 

20 nucleic acid chemistry. 

Specific detection of modified nucleotides, and preferential binding of DNA-binding 
proteins thereto, is desirable. However, agents which are capable of reliably targeting a 
protein to a modified nucleic acid in a sequence-specific manner are not available in the 
25 art. 

Summary of the Invention 

We have now determined that modified nucleosides can be specifically recognised, over 
30 unmodified equivalents, by zinc finger polypeptides in a sequence-dependent manner. 
The invention accordingly provides a method for producing a zinc finger polypeptide 
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which binds to a target nucleic acid sequence containing a modified nucleic acid base, but 
not to an identical sequence containing the equivalent unmodified base. 

In the present invention, a "modified" base is a nucleic acid base other than A, C, G or T 
5 as they occur in DNA in nature. Thus, the term modified includes methylated bases, 
such as 5-meC which occurs naturally in DNA, and base analogues, including naturally- 
occurring analogues such as U and artificial analogues such as I, backbone-modified 
bases and other artificial nucleosides. 

10 In a first embodiment, the invention provides a method for preparing a DNA binding 
polypeptide of the Cys2-His2 zinc finger class capable of binding to a DNA triplet in 
target DNA sequence comprising 5-meC as the central residue in the target DNA triplet, 
wherein binding to the 5-meC residue by an a-heiical zinc finger DNA binding motif of 
the polypeptide is achieved by placing an Ala residue at position +3 of the a-helix of the 

15 zinc finger. 

All of the DNA-binding residue positions of zinc fingers, as referred to herein, are 
numbered from the first residue in the a-helix of the finger, ranging from +1 to +9. 
refers to the residue in the framework structure immediately preceding the a-helix in a 
20 Cys2-His2 zinc finger polypeptide. Residues referred to as "++ " are residues present in 
an adjacent (C-terminal) finger. Where there is no C-terminal adjacent finger, b >-r * 
interactions do not operate. 

Cys2-His2 zinc finger binding proteins, as is well known in the art, bind to target nucleic 
25 acid sequences via a-helical zinc metal atom co-ordinated binding motifs known as zinc 
fingers. Each zinc finger in a zinc finger nucleic acid binding protein is responsible for 
determining binding to a nucleic acid triplet in a nucleic acid binding sequence. 
Preferably, there are 2 or more zinc fingers, for example 2, 3, 4, 5 or 6 zinc fingers, in 
each binding protein. Advantageously, there are 3 zinc fingers in each zinc finger 
30 binding protein. 
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The method of the present invention allows the production of what are essentially 
artificial DNA binding proteins. In these proteins, artificial analogues of ammo acids 
may be used, to impart the proteins with desired properties or for other reasons. Thus, the 
term k 'amino acid", particularly in the context where "any amino acid" is referred to, 
5 means any sort of natural or artificial ammo acid or amino acid analogue that may be 
employed in protein construction according to methods known in the art. Moreover, any 
specific amino acid referred to herein may be replaced by a functional analogue thereof, 
particularly an artificial functional analogue. The nomenclature used herein therefore 
specifically comprises within its scope functional analogues of the defined amino acids. 

10 

The a-helix of a zinc finger binding protein aligns antiparallel to the nucleic acid strand, 
such that the primary nucleic acid sequence is arranged 3' to 5' in order to correspond 
with the N terminal to C-terminal sequence of the zinc finger. Since nucleic acid 
sequences are conventionally written 5' to 3*, and amino acid sequences N-terminus to 

15 C-terminus, the result is that when a nucleic acid sequence and a zinc finger protein are 
aligned according to convention, the primary interaction of the zinc finger is with the - 
strand of the nucleic acid, since it is this strand which is aligned 3' to 5\ These 
conventions are followed in the nomenclature used herein. It should be noted, however, 
that in nature certain fingers, such as finger 4 of the protein GLI, bind to the + strand of 

20 nucleic acid: see Suzuki et al, (1994) NAR 22:3397-3405 and Pavletich and Pabo, (1993) 
Science 261:1701-1707. The incorporation of such fingers into DNA binding molecules 
according to the invention is envisaged. 

The invention provides a solution to a problem hitherto unaddressed in the art, by 
25 permitting the rational design of polypeptides which will bind DNA triplets containing a 
5-meC residue, but not identical triplets containing a C residue. 

The present invention may be integrated with the rules set forth for zinc finger 
polypeptide design in our copending UK patent applications listed above. In a preferred 
30 aspect, therefore, the invention provides a method for preparing a DNA binding 
polypeptide of the Cys2-His2 zinc finger class capable of binding to a DNA triplet in 
target DNA sequence comprising 5-meC, but not to an identical triplet comprising 
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unmethylated C, wherein binding to each base of the triplet by an a-helical zinc finger 
DNA binding motif in the polypeptide is determined as follows: 



a) if the 5' base in the triplet is G, then position +6 in the a-helix is Arg and/or position 
5 4-4-2 is Asp; 

b) if the 5' base in the triplet is A* then position +6 in the a-helix is Gin or Glu and -r+2 
is not Asp; 

c) if the 5' base in the triplet is T, then position +6 in the a-helix is Ser or Thr and 
position +4-2 is Asp; or position 4-6 is a hydrophobic amino acid other than Ala; 

10 d) if the 5' base in the triplet is C, then position +6 in the a-helix may be any amino acid, 
provided that position ++2 in the a-helix is not Asp; 

e) if the central base in the triplet is G, then position 4-3 in the a-helix is His; 

f) if the central base in the triplet is A, then position 4-3 in the a-helix is Asn; 

g) if the central base in the triplet is T, then position 4-3 in the a-helix is Ala, Ser, He, 
15 Leu, Thr or Val; provided that if it is Ala, then one of the residues at -1 or +6 is a small 

residue; 

h) if the central base in the triplet is 5-meC, then position +3 in the a-helix is Ala, Ser, 
He, Leu, Thr or Val; provided that if it is Ala, then one of the residues at -1 or +6 is a 
small residue; 

20 i) if the 3' base in the triplet is G, then position -1 in the a-helix is Arg; 

j) if the 3' base in the triplet is A, then position -1 in the a-helix is Gin and position +2 is 
Ala; 

k) if the 3' base in the triplet is T, then position -1 in the a-helix is Asn; or position -1 is 
Gin and position +2 is Ser; 
25 1) if the 3* base in the triplet is C, then position -1 in the a-helix is Asp and Position +1 
is Arg. 

The foregoing represents a set of rules which permits the design of a zinc finger binding 
protein specific for any given DNA sequence incorporating 5-meC. 

30 

A zinc finger binding motif is a structure well known to those in the art and defined in, 
for example, Miller ex aL, (1985) EMBO J. 4:1609-1614; Berg (1988) PNAS (USA) 
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85:99-102; Lee ex aL, (1989) Science 245:635-637; see International patent applications 
WO 96/06166 and WO 96/32475, corresponding to USSN 08/422,107, incorporated 
herein by reference. 

5 In general, a preferred zinc finger framework has the structure: 

(A) X 0 _ : C X,_, C X 9 _, 4 H X 3 _. V c 

where X is any amino acid, and the numbers in subscript indicate the possible numbers of 
10 residues represented by X. 

In a preferred aspect of the present invention, zinc finger nucleic acid binding motifs may 
be represented as motifs having the following primary structure: 

15 (B) X a C X 2 _ 4 C X 2 _, FX c XXXXLXXHXXX b H~ linker 

-1 123456789 

wherein X (including X a , X b and X c ) is any amino acid. X2-4 and X2-3 refer to the 
presence of 2 or 4, or 2 or 3. amino acids, respectively. The Cys and His residues, which 
20 together co-ordinate the zinc metal atom, are marked in bold text and are usually 
invariant, as is the Leu residue at position +4 in the cc-helix. 

Modifications to this representation may occur or be effected without necessarily 
abolishing zinc finger function, by insertion, mutation or deletion of amino acids. For 

25 example it is known that the second His residue may be replaced by Cys (Krizek el al, 
(1991) J. Am. Chem. Soc. 113:4518-4523) and that Leu at +4 can in some 
circumstances be replaced with Arg. The Phe residue before Xc may be replaced by any 
aromatic other than Trp. Moreover, experiments have shown that departure from the 
preferred structure and residue assignments for the zinc finger are tolerated and may even 

30 prove beneficial in binding to certain nucleic acid sequences. Even taking this into 
account, however, the general structure involving an a-heiix co-ordinated by a zinc atom 
which contacts four Cys or His residues, does not alter. As used herein, structures (A) 
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and (B) above are taken as an exemplary structure representing ali zinc finger structures 
of the Cys2-His2 type. 



Preferably, X a is F / Y -X or P- F / Y -X. In this context, X is any amino acid. Preferably, in 
5 this context X is E, K, T or S. Less preferred but also envisaged are Q, V, A and P. The 
remaining amino acids remain possible. 

Preferably, X 2 -4 consists of two amino acids rather than four. The first of these amino 
acids may be any amino acid, but S, E. K, T, P and R are preferred. Advantageously, it is 
10 P or R. The second of these amino acids is preferably E, although any amino acid may be 
used. 

Preferably, X b is T or L 

15 Preferably, X c is S or T. 

Preferably, X 2 -3 is G-K-A, G-K-C, G-K-S or G-K-G. However, departures from the 
preferred residues are possible, for example in the form of M-R-N or M-R. 

20 Preferably, the linker is T-G-E-K or T-G-E-K-P. 

As set out above, the major binding interactions occur with amino acids -1, +3 and +6. 
Amino acids +4 and +7 are largely invariant. The remaining amino acids may be 
essentially any amino acids. Preferably, position +9 is occupied by Arg or Lys. 
25 Advantageously, positions +1, +5 and +8 are not hydrophobic amino acids, that is to say 
are not Phe, Trp or Tyr. Preferably, position +4-2 is any amino acid, and preferably 
serine, save where its nature is dictated by its role as a ++2 amino acid for an N-terminal 
zinc finger in the same nucleic acid binding molecule. 

30 In a most preferred aspect, therefore, bringing together the above, the invention allows 
the definition of every residue in a zinc finger DNA binding motif which will bind 
specifically to a given DNA triplet incorporating a 5-meC residue as the central residue in 
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the triplet. Where targeting of a 5-meC containing sequence is desired, therefore, a 
suitable zinc finger can be constructed selecting a binding site such that 5-meC occurs at 
the centre of at least one base triplet thereof. 

5 The code provided by the present invention is not entirely rigid; certain choices are 
provided. For example, positions +1, +5 and +8 may have any amino acid allocation, 
whilst other positions may have certain options: for example, the present rules provide 
that, for binding to a central T residue, any one of Ala, Ser or Val may be used at +3. In 
its broadest sense, therefore, the present invention provides a very large number of 

10 proteins which are capable of binding to every defined target DNA triplet incorporating 
5-meC as the central residue and thereby any DNA binding site incorporating 5-meC 

Preferably, however, the number of possibilities may be significantly reduced. For 
example, the non-critical residues +1, +5 and +8 may be occupied by the residues Lys, 
15 Thr and Gin respectively as a default option. In the case of the other choices, for 
example, the first-given option may be employed as a default. Thus, the code according 
to the present invention allows the design of a single, defined polypeptide (a "default" 
polypeptide) which will bind to its target triplet. 

20 In a further aspect of the present invention, there is provided a method for preparing a 
DNA binding protein of the Cys2-His2 zinc finger class capable of binding to a target 
DNA sequence incorporating 5-meC, comprising the steps of: 

a) selecting a model zinc finger domain from the group consisting of naturally occurring 
25 zinc fingers and consensus zinc fingers; and 

b) mutating at least one of positions -1, +3, +6 (and ++2) of the finger as required by a 
method according to the present invention. 

30 In general, naturally occurring zinc fingers may be selected from those fingers for which 
the DNA binding specificity is known. For example, these may be the fingers for which a 
crystal structure has been resolved: namely Zif 268 (Elrod-Erickson et aL, (1996) 
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Structure 4:1171-1180), GLI (Pavletich and Pabo, (1993) Science 261:1701-1707), 
Tramtrack (Fairall ex aL (1993) Nature 366:483-487) and YY1 (Houbaviy"^ aL (1996) 
PNAS (USA) 93:13577-13582). 

5 The naturally occurring zinc finger 2 in Zif 268 makes an excellent starting point from 
which to engineer a zinc finger and is preferred. 

Consensus zinc finger structures may be prepared by comparing the sequences of known 
zinc fingers, irrespective of whether their binding domain is known. Preferably, the 
10 consensus structure is selected from the group consisting of the consensus structure P Y 
KCPECGKSFSQKSDLVKHQRTHTG, and the consensus structure P Y K 
CSECGKAFSQKSNLTRHQRIHTGEKR 

The consensuses are derived from the consensus provided by Krizek ex aL, (1991) J. Am. 
15 Chem. Soc. 113:4518-4523 and from Jacobs, (1993) PhD thesis, University of 
Cambridge, UK, In both cases, the linker sequences described above for joining two zinc 
finger motifs together, namely TGEK or TGEKP can be formed on the ends of the 
consensus. Thus, a P may be removed where necessary, or, in the case of the consensus 
terminating TG,EK (P) can be added. 

20 

When the nucleic acid specificity of the model finger selected is known, the mutation of 
the finger in order to modify its specificity to bind to the target DNA may be directed to 
residues known to affect binding to bases at which the natural and desired targets differ. 
Otherwise, mutation of the model fingers should be concentrated upon residues -1, +3, +6 
25 and 4-+2 as provided for in the foregoing rules. 

In order to produce a binding protein having improved binding, moreover, the rules 
provided by the present invention may be supplemented by physical or virtual modelling 
of the protein/DNA interface in order to assist in residue selection. 

30 



mm 



ii i npw 1 1 "mm m mi r 1 m 



WO 99/47656 ^ PCT/GB99/00816 

In a second embodiment, the invention provides a method for producing a zinc finger 
polypeptide capable of binding to a DNA sequence comprising a modified* residue, bui 
not to an identical sequence comprising an equivalent unmodified residue, comprising: 

5 a) providing a nucleic acid library encoding a repertoire of 2inc finger polypeptides, 
the nucleic acid members of the library being at least partially randomised at one or more 
of the positions encoding residues -1, 2, 3 and 6 of the a-helix of the zinc finger 
polypeptides; 

10 b) displaying the library in a selection system and screening it against a target DNA 
sequence comprising the modified residue; 

c) isolating the nucleic acid members of the library encoding zinc finger 
polypeptides capable of binding to the target sequence; and 

15 

d) optionally, verifying that the zinc finger polypeptides do not bind significantly to 
a DNA sequence identical to the target DNA sequence but containing the unmodified 
residue in place of the modified residue. 

20 Methods for the production of libraries encoding randomised polypeptides are known in 
the art and may be applied in the present invention. Randomisation may be total, or 
partial; in the case of partial randomisation, the selected codons preferably encode 
options for amino acids as set forth in the rules of the first embodiment of the present 
invention. Thus, the first and second embodiments may advantageously be combined. 

25 

Preferably, the modified residue is 5-meC and the unmodified residue is C However, 
other modifications may be targeted by the method of the invention. For example, zinc 
finger polypeptides may be designed which specifically bind to nucleic acids 
incorporating the base U, in preference to the equivalent base T. An advantage of the 
30 second embodiment of the invention is that zinc finger polypeptides may be developed to 
bind to any DNA sequence incorporating a modified base, irrespective of its positioning 
in the target DNA triplet. 
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In a further preferred aspect, the invention comprises a method for producing a zinc 
finger polypeptide capable of binding to a DNA sequence comprising a modified residue, 
but not to an identical sequence comprising an equivalent unmodified residue, 
5 comprising: 

a) providing a nucieic acid library encoding a repertoire of zinc finger polypeptides 
each possessing more than one zinc fingers, the nucleic acid members of the library being 
at least partially randomised at one or more of the positions encoding residues -L, 2, 3 and 

10 6 of the a-helix in a first zinc finger and at one or more of the positions encoding residues 
-L 2> 3 and 6 of the a-helix in a further zinc finger of the zinc finger polypeptides; 

b) displaying the library in a selection system and screening it against a target DNA 
sequence comprising the modified residue; 

15 

c) isolating the nucleic acid members of the library encoding zinc finger 
polypeptides capable of binding to the target sequence; and 

d) optionally, verifying that the zinc finger polypeptides do not bind significantly to 
20 a DNA sequence identical to the target DNA sequence but containing the unmodified 

residue in place of the modified residue. 

In this aspect, the invention encompasses library technology described in our copending 
International patent application WO98/53057, incorporated herein by reference in its 
25 entirety. WO98/53057 describes the production of zinc finger polypeptide libraries in 
which each individual zinc finger polypeptide comprises more than one, for example two 
or three, zinc fingers; and wherein within each polypeptide partial randomisation occurs 
in at least two zinc fingers. 

30 This allows for the selection of the "overlap" specificity, wherein, within each triplet, the 
choice of residue for binding to the third nucleotide (read 3' to 5' on the + strand) is 
influenced by the residue present at position +2 on the subsequent zinc finger, which 
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displays cross-strand specificity in binding. The selection of zmc finger polypeptides 
incorporating cross-strand specificity of adjacent zinc fingers enables the selection of 
nucleic acid binding proteins with a higher degree of specificity than is otherwise 
possible. 

5 

Advantageously, in order to derive the greatest benefit, the binding site is selected such 
that the modified base is in position 3 of one of the triplets, such that cross-strand 
specificity can be relied upon to contact the parallel strand in the corresponding position 
and introduce a further level of discrimination. 

10 

In a third embodiment, the present invention may be applied to the production of zmc 
finger polypeptides capable of binding to a DNA sequence comprising an unmethylated C 
residue, but not to an identical sequence comprising a 5-meC residue. This may be 
carried out by differential screening, as set forth above. Moreover, rules may be applied 
1 5 in addition to or instead of screening. 

Where the central residue of a target triplet is C, the use of Asp at position 4-3 of a zinc 
finger polypeptide allows preferential binding to C over 5-meC 

20 

Brief Description of the Figures 

Figure la is an alignment of the amino acid sequence of the three fingers from Zif268 
used in a phage display library. Randomised residue positions in the a-helix of finger 2 
25 are marked 'X' and are numbered above the alignment relative to the first helical residue 
(position +1). Residues which form the hydrophobic core are circled; zinc ligands are 
written as white letters on a black circle background; and positions comprising the 
secondary structure elements of a zinc finger are marked below the sequence. 

30 Figure lb shows amino acid sequences of the variant a-helical regions from some zinc 
fingers selected by phage display using the DNA binding site GCGGNGGCG where the 
central (bold) nucleotide of the middle (underlined) triplet was either: (i) 5- 
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methylcytosine, (ii) thymine, or (iii) cytosine. Amino acid sequences axe listed below the 
DNA oligonucleotide used in their selection, Amino acid positions are numbered above 
the aligned sequences relative to the first helical residue (position +1). Circled residues 
(in position +3) are predicted to contact the middle nucleotide of the binding sue. 

5 

Figure 1c shows a phage ELISA binding assay showing discrimination of pynmidines by 
representative phage-selected zinc fingers. The matrix shows three different zinc finger 
phage clones (x, y and z) reacted with four different DNA binding sites present at a 
concentration of 3nM. Binding is represented by vertical bars which indicate the OD 
10 obtained by ELISA (Choo and Klug, (1997) Curr. Opim Str. Biol. 7:117-125). The 
amino acid sequences of the variant ohelical regions from the selected zinc fingers are; 
REDVLIRHGK (x), RADALMVHKR (y), and RGPDLARHGR (z). The DNA 
sequences contain the generic binding site GCGGNGGCG, where the central (bold) 
nucleotide was either: uracil (U), thymine (T), cytosine (C), or 5-methylcytosine (M).- 

15 

Figure 2 shows the effect of cytosine methylation on DNA binding by phage-selected 
zinc fingers. Graphs show three different zinc finger phage binding to the DNA sequence 
GCGGCGGCG in the presence (circle) and absence (triangle) of methylation of the 
central base (bold). The zinc finger clones tested contained variant a-helical regions of 
20 the middle finger as follows: (a) RADALMVHKR, (b) RGPDLARHGR and (c) 
REDVLIRHGK. These respective zinc finger clones preferentially bind their cognate 
DNA site in the presence, absence, or regardless of cytosine methylation. 

Figure 3 shows the binding site interactions of 5 zinc finger polypeptides, selected taking 
25 into account cross-strand specificity by overlapping finger randomisation, with each of 
the oligonucleotides used in the selection process. Cross-strand contacts are shown. 

Figure 4 is analogous to Figure 2 and shows the binding curves for four of the 
polypeptides as described in Figure 3 to their respective oligonucleotides. 

30 

Figure 5 shows discrimination between 5-meC and T by zfHAE(M). 
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Figure 6 shows binding of zmc finger polypeptides zfHHAfM) and zfHAE(M) to a 
nucleotide sequence (Figure 6a) m response to selective methylation by addition of 
methylase enzymes (Figure 6b). Polypeptides zfHHA(Y) and zfHAE(Y) do not 
discnminate between methylated and unmethylated DNA. as expected. 

5 

Detailed Description of the Invention 

Randomisation involves may involve of zinc finger polypeptides at the DNA or protein 
10 level. Mutagenesis and screening of zinc finger polypeptides may be achieved by any 
suitable means. Preferably, the mutagenesis is performed at the nucleic acid level, for 
example by synthesising novel genes encoding mutant proteins and expressing these to 
obtain a variety of different proteins. Alternatively, existing genes can be themselves 
mutated, such by site-directed or random mutagenesis, in order to obtain the desired 
15 mutant genes. 

Mutations may be performed by any method known to those of skill in the art. Preferred, 
however, is site-directed mutagenesis of a nucleic acid sequence encoding the protein of 
interest. A number of methods for site-directed mutagenesis are known in the art, from 
20 methods employing single-stranded phage such as Ml 3 to PCR-based techniques (see 
"PCR Protocols: A guide to methods and applications", M.A. Innis, D.H. Gelfand, J.J. 
Sninsky, T.J. White (eds.)- Academic Press, New York, 1990). Preferably, the 
commercially available Altered Site II Mutagenesis System (Promega) may be employed, 
according to the directions given by the manufacturer. 

25 

Randomisation of the zinc finger binding motifs produced according to the invention is 
preferably directed to those residues where the code provided herein gives a choice of 
residues. For example, therefore, positions +1, +5 and +8 are advantageously 
randomised, whilst preferably avoiding hydrophobic amino acids; positions involved in 
30 binding to the nucleic acid, notably -1, +2, +3 and +-6, may be randomised also, 
preferably within the choices provided by the rules of the present invention. 
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Screening of the proteins produced by mutant genes is preferably performed by 
expressing the genes and assaying the binding ability of the protein product. A simple 
and advantageously rapid method by which this may be accomplished is by phage 
display, in which the mutant polypeptides are expressed as fusion proteins with the coat 

5 proteins of filamentous bacteriophage, such as the minor coat protein pll of bacteriophage 
ml3 or gene III of bacteriophage Fd, and displayed on the capsid of bactenophage 
transformed with the mutant genes. The target nucleic acid sequence is used as a probe to 
bind directly to the protein on the phage surface and select the phage possessing 
advantageous mutants, by affinity purification. The phage are then amplified by passage 

10 through a bacterial host, and subjected to further rounds of selection and amplification in 
order to enrich the mutant pool for the desired phage and eventually isolate the preferred 
clone(s). Detailed methodology for phage display is known in the art and set forth, for 
example, in US Patent 5,223,409; Choo and Klug, (1995) Current Opinions -in 
Biotechnology 6:431-436; Smith, (1985) Science 228:1315-1317; and McCafferty et aL y 

15 (1990) Nature 348:552-554; all incorporated herein by reference. Vector systems and kits 
for phage display are available commercially, for example from Pharmacia. 

Specific peptide ligands such as zinc finger polypeptides may moreover be selected for 
binding to targets by affinity selection using large libraries of peptides linked to the C 
20 terminus of the lac repressor Lacl (Cull et aU (1992) Proc Natl Acad Sci USA, 89, 1865- 
9). When expressed in E. coli the repressor protein physically links the ligand to the 
encoding plasmid by binding to a lac operator sequence on the plasmid. 

An entirely in vitro polysome display system has also been reported (Mattheakis et aL, 
25 (1994) Proc Natl Acad Sci U S A, 91, 9022-6) in which nascent peptides are physically 
attached via the ribosome to the RNA which encodes them. 

The library of the invention may randomised at those positions for which choices are 
given in the rules of the first embodiment of the present invention. In particular, the 
30 members of the library are randomised at position +3 for binding to a central 5-meC 
residue. In such a case, 5-meC binding polypeptides will be selected by comparative 
binding analyses against methylated and non-methylated binding sites. However, the 
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rules set forth above allow the person of ordinary skill in the an to make informed 
choices concerning the desired codon usage at the given positions. For instance, position 
+3 in the case of a central 5-meC residue should be Ala residue, encoded by the codon 
GCN. 

5 

Zinc finger binding motifs designed according to the invention may be combined into 
nucleic acid binding proteins having a multiplicity of zinc fingers. Preferably, the 
proteins have at least two zinc fingers. In nature, zinc finger binding proteins commonly 
have at least three zinc fingers, although two- zinc finger proteins such as Tramtrack are 

10 known. The presence of at least three zinc fingers is preferred. Binding proteins may be 
constructed by joining the required fingers end to end, N-terminus to C-terminus. 
Preferably, this is effected by joining together the relevant nucleic acid coding sequences 
encoding the zinc fingers to produce a composite coding sequence encoding the entire 
binding protein. The invention therefore provides a method for producing a DNA 

15 binding protein as defined above, wherein the DNA binding protein is constructed by 
recombinant DNA technology, the method comprising the steps of: 



a) preparing a nucleic acid coding sequence encoding two or more zinc finger binding 

motifs as defined above, placed N-terminus to C-terminus; 
20 b) inserting the nucleic acid sequence into a suitable expression vector; and 

c) expressing the nucleic acid sequence in a host organism in order to obtain the DNA 

binding protein. 



A "leader" peptide may be added to the N-terminal finger. Preferably, the leader peptide 
25 is MAEEKP. 



The nucleic acid encoding the DNA binding protein according to the invention can be 
incorporated into vectors for further manipulation. As used herein, vector (or plasmid) 
refers to discrete elements that are used to introduce heterologous nucleic acid into cells 
30 for either expression or replication thereof. Selection and use of such vehicles are well 
within the skill of the person of ordinary skill in the art. Many vectors are available, and 
selection of appropriate vector will depend on the intended use of the vector, i.e. whether 
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it is to be used for DNA amplification or for nucleic acid expression, the size of the DNA 
to be inserted into the vector, and the host cell to be transformed with the vector. Each 
vector contains various components depending on its function (amplification of DNA or 
expression of DNA) and the host cell for which it is compatible. The vector components 
5 generally include, but are not limited to, one or more of the following: an origin of 
replication, one or more marker genes, an enhancer element, a promoter, a transcription 
termination sequence and a signal sequence. 



Both expression and cloning vectors generally contain nucleic acid sequence that enable 
10 the vector to replicate in one or more selected host ceils. Typically in cloning vectors, 
this sequence is one that enables the vector to replicate independently of the host 
chromosomal DNA, and includes origins of replication or autonomously replicating 
sequences. Such sequences are well known for a variety of bacteria, yeast and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative 
15 bacteria, the 2\x plasmid origin is suitable for yeast, and various viral origins (e.g. SV 40, 
polyoma, adenovirus) are useful for cloning vectors in mammalian cells. Generally, the 
origin of replication component is not needed for mammalian expression vectors unless 
these are used in mammalian cells competent for high level DNA replication, such as 
COS cells, 

20 

Most expression vectors are shuttle vectors, i.e. they are capable of replication in at least 
one class of organisms but can be transfected into another class of organisms for 
expression. For example, a vector is cloned in E. coli and then the same vector is 
transfected into yeast or mammalian cells even though it is not capable of replicating 

25 independently of the host cell chromosome. DNA may also be replicated by insertion 
into the host genome. However, the recovery of genomic DNA encoding the DNA 
binding protein is more complex than that of exogenously replicated vector because 
restriction enzyme digestion is required to excise DNA binding protein DNA. DNA can 
be amplified by PCR and be directly transfected into the host ceils without any replication 

30 component. 
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Advantageously, an expression and cloning vector may contain a selection gene also 
referred to as selectable marker. This gene encodes a protein necessary for the survival or 
growth of transformed host cells grown in a selective culture medium. Host cells not 
transformed with the vector containing the selection gene will not survive m the culture 
5 medium. Typical selection genes encode proteins that confer resistance to antibiotics and 
other toxins, e.g. ampicillin. neomycin, methotrexate or tetracycline, complement 
auxotrophic deficiencies, or supply critical nutrients not available from complex media. 

As to a selective gene marker appropriate for yeast, any marker gene can be used which 
10 facilitates the selection for transformants due to the phenotypic expression of the marker 
gene. Suitable markers for yeast are, for example, those conferring resistance to 
antibiotics G418, hygromycin or bleomycin, or provide for prototrophy in an auxotrophic 
yeast mutant, for example the URA3, LEU2, LYS2, TRP1, or HIS3 gene. 

15 Since the replication of vectors is conveniently done in E. coli, an E. coli genetic marker 
and an E. coli origin of replication are advantageously included. These can be obtained 
from £. coli plasmids, such as pBR322, Bluescript© vector or a pUC plasmid, e.g. 
pUC18 or pUC19, which contain both E. coli replication origin and £ coli genetic 
marker conferring resistance to antibiotics, such as ampicillin. 

20 

Suitable selectable markers for mammalian cells are those that enable the identification of 
cells competent to take up DNA binding protein nucleic acid, such as dihydrofolate 
reductase (DHFR, methotrexate resistance), thymidine kinase, or genes conferring 
resistance to G418 or hygromycin. The mammalian cell transformants are placed under 

25 selection pressure which only those transformants which have taken up and are 
expressing the marker are uniquely adapted to survive. In the case of a DHFR or 
glutamine synthase (GS) marker, selection pressure can be imposed by culturing the 
transformants under conditions in which the pressure is progressively increased, thereby 
leading to amplification (at its chromosomal integration site) of both the selection gene 

30 and the linked DNA that encodes the DNA binding protein. Amplification is the process 
by which genes in greater demand for the production of a protein critical for growth, 
together with closely associated genes which may encode a desired protein, are reiterated 
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Increased quantities of desired 

Expression and cloning vectors usually contain a promoter that is recognised by the host 
organism and is operably linked to DNA binding protein encoding nucleic acid. Such a 
promoter may be inducible or constitutive. The promoters are operably linked to DNA 
encoding the DNA binding protein by removing the promoter from the source DNA by 
restriction enzyme digestion and inserting the isolated promoter sequence into the vector. 
Both the native DNA binding protein promoter sequence and many heterologous 
promoters may be used to direct amplification and/or expression of DNA binding protein 
encoding DNA. 

Promoters suitable for use with prokaryotic hosts include, for example, the ^-lactamase 
and lactose promoter systems, alkaline phosphatase, the tryptophan (trp) promoter system 
and hybrid promoters such as the tac promoter. Their nucleotide sequences have been 
published, thereby enabling the skilled worker operably to ligate them to DNA encoding 
DNA binding protein, using linkers or adapters to supply any required restriction sites. 
Promoters for use in bacterial systems will also generally contain a Shine-Delgarno 
sequence operably linked to the DNA encoding the DNA binding protein. 

Preferred expression vectors are bacterial expression vectors which comprise a promoter 
of a bacteriophage such as phagex or 11 which is capable of functioning in the bacteria. 
In one of the most widely used expression systems, the nucleic acid encoding the fusion 
protein may be transcribed from the vector by T7 RNA polymerase (Studier et al, 
Methods in Enzymol. 185; 60-89, 1990). In the E. coli BL21(DE3) host strain, used in 
conjunction with pET vectors, the T7 RNA polymerase is produced from the X-lysogen 
DE3 in the host bacterium, and its expression is under the control of the IPTG inducible 
lac UV5 promoter. This system has been employed successfully for over-production of 
many proteins. Alternatively the polymerase gene may be introduced on a lambda phage 
by infection with an int- phage such as the CE6 phage which is commercially available 
(Novagen, Madison, USA), other vectors include vectors containing the lambda PL 
promoter such as PLEX (Invitrogen, NL) , vectors containing the trc promoters such as 
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in tandem within the chromosomes of recombinant cells, 
protein are usually synthesised from thus amplified DNA. 
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pTrcHisXpressTm (Invitrogen) or pTrc99 (Pharmacia Biotech, SE) or vectors containing 
the tac promoter such as pKK223-3 (Pharmacia Biotech) or PMAL (New England 
Biolabs, MA, USA). 

5 Moreover, the DNA binding protein gene according to the invention preferably includes a 
secretion sequence in order to facilitate secretion of the polypeptide from bacterial hosts, 
such that it will be produced as a soluble native, peptide rather than in an inclusion body 
The peptide may be recovered from the bacterial periplasmic space, or the culture 
medium, as appropriate. 

10 

Suitable promoting sequences for use with yeast hosts may be regulated or constitutive 
and are preferably derived from a highly expressed yeast gene, especially a 
Saccharomyces cerevisiae gene. Thus, the promoter of the TRP1 gene, the ADHL or 
ADHII gene, the acid phosphatase (PH05) gene, a promoter of the yeast mating 

15 pheromone genes coding for the a- or a-f actor or a promoter derived from a gene 
encoding a glycolytic enzyme such as the promoter of the enoiase, 
glyceraldehyde-3-phosphate dehydrogenase (GAP), 3-phospho glycerate kinase (PGK), 
hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate 
isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triose phosphate isomerase, 

20 phosphoglucose isomerase or glucokinase genes, or a promoter from the TATA binding 
protein (TBP) gene can be used. Furthermore, it is possible to use hybrid promoters 
comprising upstream activation sequences (UAS) of one yeast gene and downstream 
promoter elements including a functional TATA box of another yeast gene, for example a 
hybrid promoter including the UAS(s) of the yeast PH05 gene and downstream promoter 

25 elements including a functional TATA box of the yeast GAP gene (PH05-GAP hybrid 
promoter). A suitable constitutive PH05 promoter is e.g. a shortened acid phosphatase 
PH05 promoter devoid of the upstream regulatory elements (UAS) such as the PH05 
(-173) promoter element starting at nucleotide -173 and ending at nucleotide -9 of the 
PH05 gene. 

30 

DNA binding protein gene transcription from vectors in mammalian hosts may be 
controlled by promoters derived from the genomes of viruses such as polyoma virus, 
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adenovirus, fowlpox virus, bovine papilloma virus, avian sarcoma virus, cytomegalovirus 
(CMV), a retrovirus and Simian Virus 40 (SV40), from heterologous mammalian 
promoters such as the actin promoter or a very strong promoter, e.g. a ribosomal protein 
promoter, and from the promoter normally associated with DNA binding protein 
5 sequence, provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding DNA binding protein by higher eukaryotes may be 
increased by inserting an enhancer sequence into the vector. Enhancers are relatively 
orientation and position independent. Many enhancer sequences are known from 
10 mammalian genes (e.g. elastase and globin). However, typically one will employ an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late 
side of the replication origin (bp 100-270) and the CMV early promoter enhancer. The 
enhancer may be spliced into the vector at a position 5' or 3' to DNA binding protein 
DNA, but is preferably located at a site 5' from the promoter. 

15 

Advantageously, a eukaryotic expression vector encoding a DNA binding protein 
according to the invention may comprise a locus control region (LCR). LCRs are capable 
of directing high-level integration site independent expression of transgenes integrated 
into host cell chromatin, which is of importance especially where the DNA binding 
20 protein gene is to be expressed in the context of a permanently-transfected eukaryotic cell 
line in which chromosomal integration of the vector has occurred, or in transgenic 
animals. 

Eukaryotic vectors may also contain sequences necessary for the termination of 
25 transcription and for stabilising the mRNA. Such sequences are commonly available 
from the 5' and 3' untranslated regions of eukaryotic or viral DNAs or cDNAs. These 
regions contain nucleotide segments transcribed as polyadenylated fragments in the 
untranslated portion of the mRNA encoding DNA binding protein. 

30 An expression vector includes any vector capable of expressing DNA binding protein 
nucleic acids that are operativety linked with regulatory sequences, such as promoter 
regions, that are capable of expression of such DNAs. Thus, an expression vector refers 
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to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus 
or other vector, that upon introduction into an appropriate host cell, results in expression 
of the cloned DNA. Appropriate expression vectors are well known to those with 
ordinary skill in the art and include those that are replicable in eukaryotic and/or 
5 prokaryouc cells and those that remain episomal or those which integrate into the host 
ceil genome. For example. DNAs encoding DNA binding protein may be inserted into a 
vector suitable for expression of cDNAs in mammalian cells, e.g. a CMV enhancer-based 
vector such as pEVRF (Matthias, et al., (1989) NAR 17, 6418). 

10 Particularly useful for practising the present invention are expression vectors that provide 
for the transient expression of DNA encoding DNA binding protein in mammalian cells. 
Transient expression usually involves the use of an expression vector that is able to 
replicate efficiently in a host cell, such that the host cell accumulates many copies of the 
expression vector, and, in turn, synthesises high levels of DNA binding protein. For the 

15 purposes of the present invention, transient expression systems are useful e.g. for 
identifying DNA binding protein mutants, to identify potential phosphorylation sites, or 
to characterise functional domains of the protein- 
Construction of vectors according to the invention employs conventional ligation 

20 techniques. Isolated plasmids or DNA fragments are cleaved, tailored, and religated in 
the form desired to generate the plasmids required. If desired, analysis to confirm correct 
sequences in the constructed plasmids is performed in a known fashion. Suitable 
methods for constructing expression vectors, preparing in vitro transcripts, introducing 
DNA into host cells, and performing analyses for assessing DNA binding protein 

25 expression and function are known to those skilled in the art. Gene presence, 
amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of 
mRNA, dot blotting (DNA or RNA analysis), or in situ hybridisation, using an 
appropriately labelled probe which may be based on a sequence provided herein. Those 

30 skilled in the art will readily envisage how these methods may be modified, if desired. 
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In accordance with another embodiment of the present invention, there are provided cells 
containing the above-described nucleic acids. Such host cells such as prokaryote, yeast 
and higher eukaryote cells may be used for replicating DNA and producing the DNA 
binding protein. Suitable prokaryotes include eubacteria, such as Gram-negative or 
Gram-positive organisms, such as E. colL e.g. E. coli K-12 strains, DH5a and HB1Q1, 
or Bacilli. Further hosts suitable for the DNA binding protein encoding vectors include 
eukaryotic microbes such as filamentous fungi or yeast, e.g. Saccharomyces cerevisiae. 
Higher eukaryotic cells include insect and vertebrate cells, particularly mammalian cells 
including human cells, or nucleated cells from other multicellular organisms. In recent 
years propagation of vertebrate cells in culture (tissue culture) has become a routine 
procedure. Examples of useful mammalian host cell lines are epithelial or Fibroblastic 
cell lines such as Chinese hamster ovary (CHO) cells, NIH 3T3 cells, HeLa cells or 293T 
cells. The host cells referred to in this disclosure comprise cells in in vitro culture as well 
as cells that are within a host animal. 

DNA may be stably incorporated into cells or may be transiently expressed using 
methods known in the art. Stably transfected mammalian cells may be prepared by 
transfecting cells with an expression vector having a selectable marker gene, and growing 
the transfected cells under conditions selective for cells expressing the marker gene. To 
prepare transient transfectants, mammalian cells are transfected with a reporter gene to 
monitor transfection efficiency. 

To produce such stably or transiently transfected cells, the cells should be transfected 
with a sufficient amount of the DNA binding protein-encoding nucleic acid to form the 
DNA binding protein. The precise amounts of DNA encoding the DNA binding protein 
may be empirically determined and optimised for a particular cell and assay. 

Host cells are transfected or, preferably, transformed with the above-captioned expression 
or cloning vectors of this invention and cultured in conventional nutrient media modified 
as appropriate for inducing promoters, selecting transformants, or amplifying the genes 
encoding the desired sequences. Heterologous DNA may be introduced into host cells by 
any method known in the an, such as transfection with a vector encoding a heterologous 
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DNA by the calcium phosphate coprecipitation technique or by electroporation. 
Numerous methods of transfection are known to the skilled worker in the field. 
Successful transfection is generally recognised when any indication of the operation of 
this vector occurs in the host cell. Transformation is achieved using standard techniques 
5 appropriate to the particular host cells used. 

Incorporation of cloned DNA into a suitable expression vector, transfection of eukaryotic 
cells with a plasmid vector or a combination of plasmid vectors, each encoding one or 
more distinct genes or with linear DNA, and selection of transfected cells are well known 
10 in the art (see, e.g. Sambrook et al (1989) Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor Laboratory Press). 

Transfected or transformed cells are cultured using media and culturing methods known 
in the art, preferably under conditions, whereby the DNA binding protein encoded by the 
15 DNA is expressed. The composition of suitable media is known to those in the art, so 
that they can be readily prepared. Suitable culturing media are also commercially 
available. 

DNA binding proteins according to the invention may be employed in a wide variety of 
20 applications, including diagnostics and as research tools. Advantageously, they may be 
employed as diagnostic tools for identifying the presence of modified nucleic acid 
molecules in a complex mixture. DNA binding molecules according to the invention can 
differentiate single base modifications in target DNA molecules. 

25 For example, zinc fingers may be fused to nucleic acid cleavage moieties, such as the 
catalytic domain of a restriction enzyme, to produce a restriction enzyme capable of 
cleaving only methylated DNA (see Kim, et al, (1996) Proc. Natl. Acad. Sci. USA 
93:1156-1160) . Using such approaches, different zinc finger domains can be used to 
create restriction enzymes with any desired recognition nucleotide sequence, but which 

30 cleave DNA conditionally dependent on the particular modification of the nucleotides, for 
instance methylation of the cytosine ring at position 5. 
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5-meC targeting zinc fingers may moreover be employed in the regulation of gene 
transcription, for example by specific cleavage of methylated (or unmethylated) 
sequences using a fusion polypeptide comprising a zinc finger targeting domain and a 
DNA cleavage domain, or by fusion of an activating domain (such as HSV VP 16) to a 
5 zinc finger, to activate transcription from a gene which possesses the zinc finger binding 
sequence in its upstream sequences. Activation only occurs when the target DNA is 
modified, such as by methylation. Zinc fingers capable of differentiating between U and 
T may be used to preferentially target RNA or DNA. as required. Where RNA-targeting 
polypeptides are intended, these are included in the term "DNA-binding molecule". 

10 

In a preferred embodiment, the zinc finger polypeptides of the invention may be 
employed to detect the presence of a particular base modification in a target nucleic acid 
sequence in a sample. 

15 Accordingly, the invention provides a method for determining the presence of a target 
modified nucleic acid molecule, comprising the steps of: 

a) preparing a DNA binding protein by the method set forth above which is specific for 

the target modified nucleic acid molecule; 
20 b) exposing a test system comprising the target modified nucleic acid molecule to the 

DNA binding protein under conditions which promote binding, and removing any 

DNA binding protein which remains unbound; 
c) detecting the presence of the DNA binding protein in the test system. 

25 In a preferred embodiment, the DNA binding molecules of the invention can be 
incorporated into an ELISA assay. For example, phage displaying the molecules of the 
invention can be used to detect the presence of the target DNA, and visualised using 
enzyme-linked anti-phage antibodies. 

30 Further improvements to the use of zinc finger phage for diagnosis can be made, for 
example, by co-expressing a marker protein fused to the minor coat protein (gVIII) of 
bacteriophage. Since detection with an anti-phage antibody would then be obsolete, the 
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time and cost of each diagnosis would be further reduced. Depending on the 
requirements, suitable markers for display might include the fluorescent proteins ( A. B. 
Cubitt, eial.. (1995) Trends Biochem Sa. 20, 448-455; T. T. Yang, et al.. (1996) Gene 
173, 19-23), or an enzyme such as alkaline phosphatase which has been previously 
displayed on gill ( J. McCafferty. R. H. Jackson, D. J. Chiswell. (1991) Protein 
Engineering 4, 955-961) Labelling different types of diagnostic phage with distinct 
markers would allow multiplex screening of a single DNA sample. Nevertheless, even in 
the absence of such refinements, the basic ELISA technique is reliable, fast, simple and 
particularly inexpensive. Moreover u requires no specialised apparatus, nor does it 
employ hazardous reagents such as radioactive isotopes, making it amenable to routine 
use in the clinic. The major advantage of the protocol is that it obviates the requirement 
for sel electrophoresis, and so opens the way to automated DNA diagnosis. 

The invention provides DNA binding proteins which can be engineered with exquisite 
specificity. The invention lends itself, therefore, to the design of any molecule of which 
specific DNA binding is required. For example, the proteins according to the invention 
may be employed in the manufacture of chimeric restriction enzymes, in which a nucleic 
acid cleaving domain is fused to a DNA binding domain comprising a zinc finger as 
described herein. 

The invention is described below, for the purpose of illustration only, in the following 
examples. 



Example 1 

Preparation and Screening of a Zinc Finger Phage Display Library 

A powerful method of selecting DNA binding proteins is the cloning of peptides (Smith 
(1985) Science 228, 1315-1317), or protein domains (McCafferty et al, (1990) Nature 
348:552-554; Bass et al, (1990) Proteins 8:309-314), as fusions to the minor coat protein 
(pill) of bacteriophage fd, which leads to their expression on the tip of the capsid. A phage 
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display library is created comprising variants of the middle finger from the DNA binding 
domain of Zif268. 

Materials And Methods 

5 Construction And Cloning Of Genes. In general, procedures and matenals are m 
accordance with guidance given in Sambrook et aL, Molecular Cloning, A Laboratory' 
Manual, Cold Spring Harbor, 1989. The gene for the Zif268 fingers (residues 333-420) is 
assembled from 8 overlapping synthetic oligonucleotides (see Choo and Klug, (1994) 
PNAS (USA) 91:11 163-67), giving Sfil and Notl overhangs. The genes for fingers of the 

10 phage library are synthesised from 4 oligonucleotides by directional end to end ligation 
using 3 short complementary linkers,' and amplified by PCR from the single strand using 
forward and backward primers which contain sites for Notl and Sfil respectively. Backward 
PCR primers in addition introduce Met-Ala-Glu as the first three amino acids of the zinc 
finger peptides, and these are followed by the residues of the wild type or library fingers- as 

15 required. Cloning overhangs are produced by digestion with Sfil and Notl where necessary. 
Fragments are ligated to Ijag similarly prepared Fd-Tet-SN vector. This is a derivative of 
fd-tet-DOGl (Hoogenboom et aU (1991) Nucleic Acids Res, 19, 4133-4137) in which a 
section of the pelB leader and a restriction site for the enzyme Sfil (underlined) have been 
added by site-directed mutagenesis using the oligonucleotide: 

20 

5* CTCCTGCAGTTGGACCTGTGCCAT GGCCGGCTGGGC CGCATAGAATGG 
AACAACTAAAGC 3' (Seq ID No. 1) 

which anneals in the region of the polylinker. Electrocompetent DH5a cells are 
25 transformed with recombinant vector in 200ng aliquots, grown for 1 hour in 2xTY medium 
with 1% glucose, and plated on TYE containing 15ug/ml tetracycline and 1% glucose. 

Figure 1 shows the amino acid sequences of the three zinc fingers derived from Zif268 used 
in the phage display library of the present invention. The top and bottom rows represent the 
30 sequence of the first and third fingers respectively. The middle row represents the sequence 
of the middle finger. The randomised positions in the a-helix of the middle finger have 
residues marked 'X 1 . The amino acid positions are numbered relative to the first helical 
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residue (position 1). For amino acids at positions -1 to +8. excluding the conserved Leu and 
His, codons are equal mixtures of (G,A.CjNN: T in the first base position is omitted in order 
to avoid stop codons, but this has the unfortunate effect that the codons for Trp, Phe, Tyr 
and Cys are not represented. Position -r9 is specified by the codon A(G.A)G, allowing 
5 either Arg or Lys. Residues of the hydrophobic core are circled, whereas the zinc hgands 
are written as white letters on black circles. The positions forming the p-sheets and the a- 
helix of the zinc fingers are marked below the sequence. 



Phage Selection. Colonies are transferred from plates to 200ml 2xTY/Zn/Tet (2xTY 

10 containing 50jJ,M Zn(CH3COO) 2 and 15ug/ml tetracycline) and grown overnight. Phage 
are purified from the culture supernatant by two rounds of precipitation using 0.2 volumes 
of 20% PEG/2.5M NaCl containing 50uM Zn(CH3.COO) 2 , and resuspended m zinc finger 
phage buffer (20mM HEPES pH7.5, 50mM NaCl, ImM MgCI 2 and 50jiM 
Zn(CH3.COO)2). Streptavidin-coated paramagnetic beads (Dynal) are washed in zinc 

15 finger phage buffer and blocked for 1 hour at room temperature with the same buffer made 
up to 6% in fat-free dried milk (Marvel). Selection of phage is over three rounds: in the first 
round, beads (1 mg) are saturated with biotinylated oligonucleotide (~80nM) and then 
washed prior to phage binding, but in the second and third rounds 1.7nM oligonucleotide 
and 5]ig poly dGC (Sigma) are added to the beads with the phage. Binding reactions 

20 (1.5ml) for 1 hour at 15°C are in zinc finger phage buffer made up to 2% in fat-free dried 
milk (Marvel) and 1% in Tween 20. and typically contained 5xl0 n phage. Beads are 
washed 15 times with lml of the same buffer. Phage are eluted by shaking in 0.1M 
triethylamine for 5min and neutralised with an equal volume of 1M Tris pH7.4. Log phase 
E coli TGI in 2xTY are infected with eluted phage for 30min at 37°C and plated as 

25 described above. Phage litres are determined by plating serial dilutions of the infected 
bacteria. 
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Sequencing Of Selected Phage. Single colonies of transformants obtained after three rounds 
of selection as described, are grown overnight in 2xTY/Zn/Tet. Small aiiquots of the 
cultures are stored in 15% glycerol at -20°C to be used as an archive. Single-stranded 
DNA is prepared from phage in the culture supernatant and sequenced using the 
5 Sequenase™ 2.0 kit (U.S. Biochemical Corp.). 

Example 2 

Isolation of zinc fingers capable of C-T differentiation 

10 The phage are selected against oligonucleotides comprising the sequences GCGGCGGCG 
and GCGGTGGCG. some zinc finger DNA-binding domains are selected which bound 
both sequences equally well (Fig. lb, c). However, two additional zinc finger families 
are isolated which are capable of differential binding to the two closely related sites (Fig. 
lb, c). Sequence-specific recognition requires discrimination of the central base in" the 

15 binding site by amino acids in position 3 of the recognition helix of the selected zinc 
fingers, and it is noted that aspartate is selected to bind opposite cytosine in the triplet 
GCG, while alanine is selected opposite thymine in the triplet GTG. The correlation 
between thymine and alanine is particularly significant, as it implies a van der Waals 
interaction between the amino acid side-chain and the 5-methyl group of the base. 

20 Indeed, when thymine is mutated to deoxyuracil in the binding sites of such fingers there 
is a dramatic decrease in the strength of the intermolecular interaction (Fig 1c). This 
shows that these zinc fingers are capable of specifically recognising a 5-methyl group, 
and suggests that similar fingers might be selected which bind 5-meC by the same token. 

25 Example 3 

Selection of 5-methylcytosine-specific zinc fingers 

The phage display library is screened with the synthetic binding site GCG GMG GCG, 
containing a 5-meC base analogue (M). After 5 rounds of selection, zinc finger phage are 
30 tested for binding to 5-meC and cytosine in the context of the above site, and those 
capable of specifically binding the methylated site are sequenced in the region of the zinc 
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finger gene. Two different clones are isolated, which are identical to the DNA-bmding 
domains previously selected using the binding site GCGGTGGCG. 

Hence the various zinc finger phage selections described above yield different fingers 
5 able to bind the generic DNA sequence GCGGNGGCG, where N is either thymine, 
cytosine or 5-meC. A full complement of fingers is selected for recognition of the 
cytosine/5-meC pair in the above context, some of which recognise one type of base 
exclusively, while others bound both bases equally well (Figures lc and 2). 

10 The zinc finger amino acid residues which are selected by the interaction between the 
randomised recognition helix and the central base of the DNA binding site are 
rationalised in terms of previously elucidated zinc finger-DNA recognition rules. Fingers 
with alanine in position +3 of the recognition helix specifically bind 5-meC and thymine 
owing to a tight hydrophobic interaction between the side chain and the 5-methyl group 

15 which is present in both bases. In contrast, a finger with valine in position +3 is also able 
to accommodate cytosine in addition to the two methylated bases, by the use of different 
rotamers. Fingers with aspartate in position 4-3 bind cytosine specifically, for example by 
forming a ring structure which packs against the pyrimidine as is observed in the refined 
crystal structure of Zif268. 

20 

Example 4 

Selection of 5-meC Specific Zinc Fingers using Cross-Strand Specificity 

1 . General Procedures 

25 Construction of overlapping finger phage display libraries 

Two zinc finger DNA binding domain libraries are constructed comprising the 
amino acid framework of wild-type Zif268, but containing randomisations in amino acid 
positions of fingers 2 and 3. The first library contains randomisations at F2 residue 
position 6 and F3 residue positions -1,1,2 and 3 and recognises sequences of the form 

30 5'-GXX-XCG-GCG-3\ The second library additionally contains variations in F2 position 
3 and F3 positions 5 and 6 and recognises sequences of the form 5'-XXX-XXG-GCG-3\ 
The libraries are denoted collectively as LF2/3. 
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The series for the two zinc finger phage display libraries are assembled from synthetic 
DNA oligonucleotides by directional end-to-end ligation using short complementary 
DNA linkers. The oligonucleotides contain selectively randomised codons, encoding all 

5 20 amino acids or a subset thereof, in the appropriate amino acid positions of fingers 2 
and 3. The constructs are amplified by PCR using primers containing Not I and Sfi I 
restriction sites, digested with the above endonucleases to produce cloning overhangs, 
and ligated into vector Fd-Tet-SN. Electrocompetent E. coli TGI cells are transformed 
with the recombinant vector and plated onto TYE medium (1.5% (w/v) agar, 1% (w/v) 

10 Bactotryptone, 0.5% (w/v) Bacto yeast extract, 0.8% (w/v) NaCl) containing 15 mg/ml 
tetracycline. 

Phage selections 

Tetracycline resistant colonies are transferred from plates into 2xTY medium 
15 (16g/litre Bactotryptone, lOg/litre Bacto yeast extract, 5g/litre NaCl) containing 50 t uM 
ZnCb and 15 ug/ml tetracycline, and cultured overnight at 30°C in a shaking incubator. 
Cleared culture supernatant containing phage particles is obtained by centrifuging at 300g 
for 5 minutes. 

20 DNAs of the form 5'-tatagtG-XXXX-GGCGtgtcacagtcagtccacacacgtc-3', and their 
complementary strands, are chemically synthesised and annealed in 20mM Tris-HCl, pH 
8, lOOmM NaCl. The DNA sequences -XXXX- represent nucleotide sequences after 
methylation by M.Haelll (GGMC) or MMhal (GMGC). Since DNA is chemically 
synthesised, the DNA sites used in selections incorporate 5-meC (in appropriate positions 

25 on both strands) with 100% yield. Selections are also carried out on derivatives of these 
sites containing thymine rather than 5-meC in the appropriate positions (and with A 
rather than C on the complementary strand as appropriate). 

One picomole of each target site is bound to streptavidin-coated tubes (Boehnnger 
30 Mannheim) in 50^1 PBS containing 50|aM ZnCb. Bacterial culture supernatant 
containing phage is diluted 1:10 in selection buffer (PBS containing 50^M ZnCb, 2% 
(w/v) fat-free dried milk (Marvel), 1% (v/v) Tween, 20^xg/ml sonicated salmon sperm 
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DNA). and lmJ is applied to each tube. In order to increase the selection pressure, 50 
pmol soluble (unbiotinylated) competitor sues are synthesised and added to the binding 
mixtures: selections for phage that bind the methylated DNA contain competitors with 
cytosine or thymine at the appropriate positions; selections for phage that discriminate 
5 thymine instead of 5-meC in the recognition sites of the methylase enzymes contain DNA 
competitors with cytosine or 5-meC at the appropriate positions. After 1 hour at 20°C, 
the tubes are emptied and washed 20 times with PBS containing 50|LiM ZnCb. 2% (w/v) 
fat-free dried milk (Marvel) and 1% (v/v) Tween. Retained phage are eluted in 0.1 mi 
0.1M triethylamine and neutralised with an equal volume of 1M Tris CpH 7.4). 
10 Logarithmic-phase E. coli TGI (0.5ml) are infected with eluted phage (50ml), and 
cultured overnight at 30°C in 2xTY medium containing 5Q\iM ZnCl2 and 15 ug/ml 
tetracycline, to prepare phage for subsequent rounds of selection. After 4 rounds of 
selection, E. coli TGI infected with selected phage are plated, individual colonies are 
picked and used to prepare phage for ELISA assays and DNA sequencing. 

15 

ELISA to determine nucleotide discrimination. 

Binding sites are synthesised as described above, including biotinylated sites 
where 5-meC (M) is replaced by a C or T (with appropriate bases in the complementary 
strand). Two-fold dilutions of DNA are added to separate wells of a streptavidin-coated 

20 microtitre plate (Boehringer Mannheim) in 50ul PBS containing 50)J,M ZnCb (PBS/Zn). 
Phage solution (bacterial culture supernatant diluted 1:10 in PBS/Zn containing 2% (w/v) 
fat-free dried milk (Marvel), 1% (v/v) Tween and 20|itg/mi sonicated salmon sperm 
DNA) are applied to each well (50jil/well). Binding is allowed to proceed for one hour at 
20°C. Unbound phage are removed by washing 6 times with PBS/Zn containing 1% (v/v) 

25 Tween, then 3 times with PBS/Zn. Bound phage are detected by ELISA using 
horseradish peroxidase-conjugated anti-M13 IgG (Pharmacia Biotech) and the 
colourimetric signal quantitated using SOFTMAX 2.32 (Molecular Devices). 

ELISA using an enzymatically methylated DNA binding site. 
30 Complementary DNA oligonucleotides containing the sequences methylated by 

M.Haelll and M.Hhal are chemically synthesised and annealed as described above. The 
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DNA is used in binding assays without exposure to the methylases, or after reaction with 
either or both methyiase enzymes according to the manufacturer's instructions (New 
England Biolabs). DNA binding sites (0.5 prnoi) are added to wells of a streptavidin- 
coated microtitre plate (Boehringer Mannheim) in 5Q\x\ PBS containing 50]xM ZnCb 
5 (PBS/Zn), The binding of various zinc finger phage clones is assayed by ELISA as 
described above. 

DNA sequence analysis 

The coding sequence of individual zinc finger clones is amplified by PCR using 
10 external primers complementary to phage sequence. These PCR products are then 
sequenced manually using Thermo Sequenase cycle sequencing (Amersham Life 
Science). 

2. Experimental Results 
15 Design of sequence-specific zinc finger proteins which bind enzymatically methylated 

DNA sites. 

The three-finger DNA-binding domain of transcription factor Zif268 binds the 
DNA sequence GCGTGGGCG. Phage display libraries of this zinc finger domain have 
been used to elucidate aspects of the base-recognition mechanism of zinc fingers and to 

20 select fingers which bind to predetermined DNA sequences. We have constructed a set 
of phage display libraries in which amino acid positions from both finger 2 (F2) and 
finger 3 (F3) of Zif268 are simultaneously randomised in order to evaluate the effect of 
inter-finger synergy on the specificity of DNA binding. These libraries, hereafter denoted 
collectively as LF2/3, contain variants which specifically recognised DNA sequences of 

25 the form XXXXCGGCG or GXXXCGGCG, where X is any nucleotide. 

The Haelll and Hhal methyltransferases modify the internal cytosine (shown in bold 
lettering) of their respective DNA recognition sequences GGCC and GCGC We 
therefore designed two DNA oligos, one containing the sequence GGCCCGGCG and the 
30 other GCOCCGGCG, which included the sites required for modification by the 
respective methylases M.Haelll or M.Hhal (underlined). The oligos also place these 
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sequences in the context of binding sites that could be used to screen LF2/3 tor zinc 
fingers that specifically recognise the modified DNA. 

The two different target DNA oligonucleotides are prepared using solid phase DNA 
5 synthesis such that 5-meC is be chemically incorporated into the appropriate positions 
(shown in bold lettering) with 100% yield, and a biotin group is added to the 5' terminus 
of one DNA strand. The synthetically modified DNAs are coupled to a solid support 
coated with streptavidin and used in separate phage selections as described above. After 
four rounds of selection, individual zinc finger clones from either selection are screened 

10 by phage ELISA for binding to the methylated form of their DNA target site and 
discrimination against a control oligo containing the unmodified DNA. Four different 
zinc finger phage clones with varying specificity are selected for further study: (i) clone 
zfHAE(M) preferentially binds the methylated DNA incorporating the Haelll site; (ii) 
clone zfHHA(M) preferentially binds the methylated DNA incorporating the Hhal site; 

15 (iii) clone zfHAE(Y) binds the DNA incorporating the Haelll site regardless of the 
methylation status; and (iv) clone zfHHA(Y) binds the DNA incorporating the Hhal site 
regardless of the methylation status. 

Table 1 shows the sequences of the oligonucleotides used for selection and of the 
20 resulting clones obtained. 
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Table 1 








Oligonucleotide Sequences 




5 


HAE(M) 


S'-tatagtG-GGMC-GGCGtgtcacagccagtccacacacgcc- 


3' 




HHA(M) 


S'-tatagtG-GMGC-GGCGtgccacagtcagtccacacacgtc- 


■ ^ 1 




HAE(Y) 


5'-tatagtG-GGYC-GGCGtgtcacagtcagtccacacacgtc- 


o 


10 










HHA(Y) 


S'-tatagtG-GYGC-GGCGcgtcacagtcagtccacacacgtc- 


-3' 




HAE(T) 


S'-tatagtG-GGTC-GGCGtgtcacagtcagtccacacacgtco 


15 


wherein; 


M = 5-meC 

Y = pyrimidine (C/T/M) 
R = Purine (A/G) 






Zinc Finger Clones 




20 




FI F2 F3 
-1123456 -1123456 -1123456 




25 


zfHAE(M) 
zfHHA(M) 
zfHAE { Y) 
zfHHA(Y) 
zfHAE(T) 


RSDELTR RSDDLSQ RKHHRKE 
RSDELTR RSDDLTR YDGARKR 
RSDELTR RSDDLTG KNRDRKR 
RSDELTR RSDHLSA TNSTRTK 
RSDELTR RSDDLST RNDHRKT 





30 



Zinc finger phage binding for each of the above clones is titrated against different 
amounts of methylated and unmethylated DNA oligos to derive values of the apparent 
dissociation constants (IQs) for either DNA binding site (see Figures 4 and 5). The 
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apparent IQ of each clone for the optimally bound DNA sue(s) is in the nanomolar range, 
similar to that of wild-type Zif268 DNA-binding domain for its preferred target site using 
this assay. The K,s obtained are shown in Table 2. Clones zfHAE(M) and zfHHA(M) 
preferentially bind their respective DNA target sites when 5-meC is incorporated into the 

5 correct nucleotide positions, and discriminated against the unmethylated DNA sites by- 
factors of approximately 20-fold and 5-fold respectively. The discrimination shown by 
zfHAE(M) in particular is good considering the simple DNA recognition mechanism of 
zinc fingers, and that only a single functional group per DNA molecule has been altered. 
Clones zfHAE(Y) and zfHHA(Y) bind their respective target sites but do not show any 

10 preference for either the modified or unmodified forms. 

The four zinc finger clones isolated by phage display using synthetic 5-meC -containing 
DNA target sites are next tested for binding to enzymatically methylated DNA. In- this 
assay a single DNA fragment is used that incorporates both the GGCC CGGCG and the 

15 GCGC CGGCG zinc finger binding site sequences (Figure 6a), which additionally are 
substrates for methylation by M.Haelll and M.Hhal respectively. Each zinc finger clone 
is tested for binding to the DNA before and after DNA modification using one or both 
methylases. Figure 6b shows that, in contrast to zfHAE(Y) and zfHHA(Y) which both 
recognise the DNA regardless of the methylation status (as would be expected), 

20 zfHAE(M) and zfHHA(M) bind only after specific methylation of the DNA by the 
appropriate methylase enzyme. Thus enzymatic modification of cytosine to 5-meC can 
act as a switch that induces specific protein-DNA complex formation. 
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K d s of each clone for target and non-target oligonucleotides 
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Clone 


Oligonucleotide 


K d 




zfHAE (M) 


G-GGMC-GGCG 


2.0 +/ 


- 0 . 2nM 




G-GGCC-GGCG 


62 + /- 


29nM 


zfHHA(M) 


G-GMGC-GGCG 


14 +/- 


3 . 2nH 




G-GCGC-GGCG 


62 +/- 


22nM 


zf HAE (Y) 


G-GGMC-GGCG 


6.3 +/ 


- 1 . 4nM 




G-GGCC-GGCG 


2.0+/- 


0 . 2nM 


zfHHA(Y) 


G-GMGC-GGCG 


14 + /- 


2 . OnM 




G-GCGC-GGCG 


11 +/- 


2 . 4nM 



15 

Synergistic zinc finger pairs that discriminate 5-methylcytosine from thymine. 

The 5 -methyl group of methylcytosine and thymine is a prominent feature of the 
DNA major groove which contributes important intermolecular (hydrophobic) contacts in 
protein-DNA interactions but is stereochemical^ indistinguishable in the two different 

20 bases. Consequently, zinc fingers - which frequently achieve DNA recognition by 1:1 
contacts between amino acids and bases - often fail to discriminate between the two 
closely related bases. The phage-selected clone zfHHA(M) is one such zinc finger 
protein which accepts both thymine and 5-meC with almost equal affinity (Figure 5). In 
this case it is likely that the aromatic ring of tyrosine forms equally good hydrophobic 

25 contacts with the methyl group of either base. 

One way in which zinc finger proteins could distinguish 5-meC from thymine is to 
discriminate the complementary nucleotide in the base-pair. Zinc finger proteins such as 
Zif268 make base contacts predominantly to only one DNA strand - the 'antiparallel' 
30 strand - but, importantly, they can also form 'cross-strand' contacts to certain bases on the 
complementary, 'parallel' strand. It has been shown that these contacts can make 
important contributions to DNA-binding specificity. Thus the zinc fingers of Zif268 and 
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related proteins can be regarded as binding to overlapping 4bp subsites, where the 
specificity for the base-pair at the boundary between adjacent subsites potentially anses 
via contacts from two synergistic zinc fingers to each of the nucleotides in the base-pair 
(Figure 3). Therefore a zinc finger protein can distinguish a 5-meC:G base-pair from a 
5 T:A base-pair provided they are positioned at the overlap between adjacent DNA 
subsites, such that a contact to the 'parallel* strand can be made. 

This is the case for the DNA binding site GGMCCGGCG in which the 5-meC base (bold) 
is discriminated from thymine by zinc finger clone zfHAE(M). According to the 

10 conventional model of zinc fmger-DNA recognition, based on the crystal structure of the 
Zif268-DNA complex and subsequent biochemical experiments, the 5-meC base in the 
binding site is contacted by the glutamine residue in a-helical position +6 of finger 2 
(Figure 3). Additionally, the complementary guanine can be recognised using a 
synergistic contact from the histidine residue in cc-helical position +2 of finger 3 (Figure 

15 3). 

In order to investigate further the discrimination between 5-meC and thymine, another 
zinc finger clone is selected, zfHAE(T), which is specific for thymine instead of 5-meC in 
the context of the above binding site. This clone makes use of a cross-strand contact 

20 from aspartate in position +2 of finger 3 to recognise adenine in the parallel' strand. In 
this respect zfHAE(T) is remarkably like the wild-type Zif268 DNA-binding domain, 
whose zinc fingers each have an Arg-Ser^Asp triad that makes inter- and intra- molecular 
contacts including cross-strand contacts from the aspartate. Discrimination in favour of 
thymine by zfHAE(T) is relatively stronger than discrimination for 5-meC by zfHAE(M), 

25 presumably owing to the stabilising effect of intramolecular (protein-protein buttressing) 
interactions and the favourable geometry of this network of contacts. 

The dissociation constants for the interactions seen between zfHAE(M), zfHHA(M) and 
zfHAE(T) and 5-meC or T oligonucleotides are set forth in Table 3. 

30 
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Table 3 

K d s of each clone for 5-meC and T oligonucleotides 
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Clone 


Oligonucleotide 


K a 




zfHAE (M) 


G-GGMC-GGCG 


2.0+/ 


- 0 .2nM 




G-GGTC-GGCG 


27 +/- 


4 . 4aM 


zfHKA(M) 


G-GMGC-GGCG 


14 tI- 


3 . 2nM 




G-GTGC-GGCG 


6.1 +/ 


- 4.5nM 


zfHAE(T) 


G-GGMC-GGCG 


3.4 +/ 


- O.SnM 




G-GGTC-GGCG 


n/a 





15 Example 5 

Methylcytosine-specific restriction enzyme 

Phage-selected or rationally designed zinc finger domains which recognise modified 
bases, including 5-meC, can be converted to restriction enzymes which cleave DNA 
20 containing those modified bases, including 5-meC This is achieved by coupling a 
modified base-specific zinc finger to a cleavage domain of a restriction enzyme or other 
nucleic acid cleaving moiety, 

A method of converting zinc finger DNA-binding domains to chimaeric restriction 
25 endonucleases has been described in Kim, ex al, (1996) Proc, NatL Acad. Sci. USA 
93:1156-1160. in order to demonstrate the applicability of methylcytosine-specific zinc 
fingers to restriction enzymes, a fusion is made between the catalytic domain of Fok I as 
described by Kim et al and the 5-meC specific zinc finger described in Example 3. 
Fusions of the 5-meC zinc finger nucleic acid-binding domain to the catalytic domain of 
30 Fok I restriction enzyme results in a novel endonuclease which cleaves DNA adjacent to 
the DNA recognition sequence of the zinc finger, namely GCGGMGGCG. 
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The oligonucleotides GCGGMGGCG and GCGGCGGCG are synthesised and ligated to 
random DNA sequences. After incubation with the zinc finger restriction enzyme, the 
nucleic acids are analysed by gel electrophoresis. Bands indicating cleavage of the 
nucleic acid at a position corresponding to the location of the oligonucleotide 
5 GCGGMGGCG are visible with the methylated, but not the unmethylated, nucleic acid. 

In a further experiment, the 5-meC-specific zinc finger is fused to an amino terminal 
copper/nickel binding motif. Under the correct redox conditions (Nagaoka, M., ei ai, 
(1994) J. Am. Chem. Soc. 116:4085-4086), sequence-specific DNA cleavage is 
10 observed, only in the presence of 5-meC containing DNA incorporating the 
oligonucleotide GCGGMGGCG. 

Example 6 

Determination of methylase activity in vivo 

15 

A reporter systems is produced which produces a reporter signal conditionally depending 
on the activity of a DNA methylase. 

A transient transfection system using zinc finger transcription factors is produced as 
20 described in Choo, Y., et aU (1997) J. Mol. Biol 273:525-532. This system comprises 
an expression plasmid which produces a 5-meC specific phage-selected zinc finger fused 
to the activation domain of HSV VP 16, and a reporter plasmid which contain the 
recognition sequence of the zinc finger upstream of a CAT reporter gene. 

25 Thus, a zinc finger which recognises the DNA sequence GCGGCCGCG selected by 
phage display as described in Choo, Y. & Klug, A. (1994) Proc. Natl. Acad. Sci. 
U.S.A. 91:11 163-1 1 167. By the method of the preceding examples, a further zinc finger 
is selected which is capable of binding to the sequence GCGGMCGCG where the central 
base M is 5-meC, and used to construct transcription factors as described in the 

30 foregoing- 
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A transient expression experiment is conducted, wherein the CAT reporter gene on the 
reporter plasmid is placed downstream of the sequence GCGGCCGCG. The reporter 
plasmid is cotransfected with a plasmid vector expressing the zinc fmger-HSV fusion 
under the control of a constitutive promoter. No activation of CAT gene expression is 
observed. 

However, when the same experiment is conducted in the presence of Hae III methylase, 
CAT expression is observed as a result of the methylation of GCGGCCGCG to form 
GCGGMCGCG, and consequent binding of the zinc finger transcription factor to its 
recognition sequence. 
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1. A zinc finger polypeptide which binds to a target DNA sequence containing a 
modified base but not to an identical sequence containing the equivalent unmodified base. 

5 

2. A polypeptide according to claim L wherein the target DNA sequence comprises 
a triplet having 5-meC at the central position, and binding to the 5-meC residue by an a- 
helical zinc finger binding motif in the polypeptide is achieved by placing an Ala residue 
at position +3 of the a-helix. 

10 

3. A method for preparing a DNA binding polypeptide of the Cys2-His2 zinc finger 
class capable of binding to a DNA triplet in target DNA sequence comprising 5-meC as 
the central residue in the target DNA triplet, wherein binding to the 5-meC residue by an 
a-helical zinc finger DNA binding motif of the polypeptide is achieved by placing an Ala 

15 residue at position +3 of the a-helix of the zinc finger. 

4. A method for preparing a DNA binding polypeptide of the Cys2-His2 zinc finger 
class capable of binding to a DNA triplet in target DNA sequence comprising 5-meC, but 
not to an identical triplet comprising unmethylated C, wherein binding to each base of the 

20 triplet by an a-helical zinc finger DNA binding motif in the polypeptide is determined as 
follows: 

a) if the 5 J base in the triplet is G, then position +6 in the a-helix is Arg and/or position 
++2 is Asp; 

25 b) if the 5' base in the triplet is A, then position +6 in the a-helix is Gin or Glu and ++2 
is not Asp; 

c) if the 5' base in the triplet is T, then position +6 in the a-helix is Ser or Thr and 
position ++2 is Asp; or position +6 is a hydrophobic amino acid other than Ala; 

d) if the 5* base in the triplet is C, then position +6 in the a-helix may be any amino acid, 
30 provided that position ++2 in the a-helix is not Asp; 

e) if the central base in the triplet is G, then position +3 in the a-helix is His; 

f) if the central base in the triplet is A, then position +3 in the a-helix is Asn; 
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g) if the central base in the triplet is T, then position +3 in the a-heiix is Ala. Ser, He, 
Leu, Thr or Val; provided that if it is Ala, then one of the residues at -1 or +6 is a small 
residue; 

h) if the central base in the triplet is 5-meC, then position +3 in the a-helix is Ala, Ser, 
5 lie, Lea, Thr or Val; provided that if it is Ala, then one of the residues at -1 or +6 is a 

small residue; 

i) if the 3' base in the triplet is G, then position -1 in the a-helix is Arg; 

j) if the 3' base in the triplet is A, then position -1 in the a-helix is Gin and position +2 is 
Ala; 

10 k) if the 3' base in the triplet is T, then position -1 in the a-helix is Asn; or position -1 is 
Gin and position +2 is Ser; 

1) if the 3' base in the triplet is C, then position -1 in the a-helix is Asp and Position +1 
is Arg. 

15 5. A method for producing a zinc finger polypeptide capable of binding to a DNA 
sequence comprising a modified residue, but not to an identical sequence comprising an 
equivalent unmodified residue, comprising the steps of: 

a) providing a DNA library encoding a repertoire of zinc finger polypeptides, the 
20 DNA members of the library being at least partially randomised at one or more of the 

positions encoding residues -1, 2, 3 and 6 of an a-helical zinc finger binding motif of the 
zinc finger polypeptides; 

b) displaying the library in a selection system and screening it against a target DNA 
25 sequence comprising the modified residue; 

c) isolating the DNA members of the library encoding zinc finger polypeptides 
capable of binding to the target sequence; and 

30 d) optionally, verifying that the zinc finger polypeptides do not bind significantly to 
a DNA sequence identical to the target DNA sequence but containing the equivalent 
unmodified residue in place of the modified residue. 
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6. A method according to claim 5, wherein the nucleic acid library' encodes a 
repertoire of zinc finger polypeptides each possessing more than one zinc fingers, the 
nucleic acid members of the library being at least partially randomised at one or more of 

5 the positions encoding residues -1, 2, 3 and 6 of the ohelix m a zinc finger and at one or 
more of the positions encoding residues -1, 2, 3 and 6 of the a-helix in a further zinc 
finger of the zinc finger polypeptides. 

7. A method according to claim 5 or claim 6, wherein the modified residue is 5-meC 
10 and the unmodified residue is C. 

8. A method according to claim 5 or claim 6, wherein the modified residue is U and 
the unmodified residue is T. 

15 9. A method according to any one of claims 5 to 8, wherein the library is screened by 
phage display. 

10. A method according to any one of claims 5 to 9, wherein the or each zinc finger 
has the general primary structure 

20 

(A) X a C X 2 . 4 C X 2 _ 3 FX c XXXXLXXHXXX b H- linker 

-1 123456789 

wherein X (including X a , X b and X c ) is any amino acid. 

25 

11. A method according to claim 10 wherein X a is F /y-X or P- f /y-X. 

12. A method according to claim 10 or claim 1 1 wherein X2.4 is selected from any one 
of: S-X, E-X, K-X, T-X, P-X and R-X. 

30 

13. A method according to any one of claims 10 to 12 wherein X b is T or I. 
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14. A method according to any one of claims 10 to 13 wherein X:-} is G-K-A, G-K-C, 
G-K-S, G-K-G, M-R-N or M-R. 



15. A method according to any one of claims 10 to 14 wherein the linker is T-G-E-K 
or T-G-E-K-P. 

16. A method according to any one of claims 10 to 15 wherein position +9 is R or K. 

17. A method according to any one of claims 10 to 16 wherein positions +1, +5 and 
+8 are not occupied by any one of the hydrophobic amino acids, F, W or Y. 

18. A method according to claim 17 wherein positions +1, +5 and +8 are occupied by 
the residues K, T and Q respectively. 

19. A method for preparing a DNA binding polypeptide of the Cys2-His2 zinc finger 
class capable of binding to a DNA triplet in target DNA sequence comprising 5-meC, but 
not to an identical triplet comprising unmethylated C: 

a) selecting a model zinc finger domain from the group consisting of naturally 
occurring zinc fingers and consensus zinc fingers; and 

b) mutating the finger by the method of any one of claims 3 to 17. 

20. A method according to claim 19, wherein the model zinc finger is a consensus 
zinc finger whose structure is selected from the group consisting of the consensus 
structure PYKCPECGKSFSQKSDLVKHQRTHTG, and the consensus 
structure PYKCSECGKAFSQKSNLTRHQRIHTGEKP. 

21. A method according to claim 19 wherein the model zinc finger is a naturally 
occurring zinc finger whose structure is selected from one finger of a protein selected 
from the group consisting of Zif 268 (Eirod-Erickson et al* (1996) Structure 4:1171- 
1180), GLI (Pavletich and Pabo, (1993) Science 261:1701-1707), Tramtrack (Fairall et 
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aU (1993) Nature 366:483-487) and YY1 (Houbaviy ec aL (1996) PNAS (USA) 
93:13577-13582). 

22. A method according to claim 21 wherein the model zinc finger is finger 2 of Zif 
5 268. 

23. A method according to any one of claims 3 to 22 wherein the binding protein 
comprises two or more zinc finger binding motifs, placed N-terminus to C-terminus. 

10 24. A method according to claim 22, wherein the N-terminal zinc finger is preceded 
by a leader peptide having the sequence MAEEKP. 

25. A method according to ciaim 23 or claim 24, wherein the DNA binding protein is 
constructed by recombinant DNA technology, the method comprising the steps of: 

15 

a) preparing a DNA coding sequence encoding two or more zinc finger binding 
preparable according to ciaim 23 or 24, placed N-tenninus to C-terminus; 

b) inserting the DNA sequence into a suitable expression vector; and 

c) expressing the DNA sequence in a host organism in order to obtain the DNA binding 
20 protein, 

26. A method according to one of claims 3 to 25 comprising the additional steps of 
subjecting the DNA binding protein to one or more rounds of randomisation and selection 
in order to improve the characteristics thereof. 

25 

27. A zinc finger polypeptide which binds to a target DNA sequence containing a 
modified base but not to an identical sequence containing the equivalent unmodified base, 
preparable by a method according to any one of claims 3 to 26. 
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(i) G C G G M G G C G 

-1 123456789 
R A D® L M V H K R 
R G D@L T S H E R 

(ii) G C G G T G G C G 

-1 123456789 
R A D®L M V H K R 
R G D@L T S H E R 
R V D®L E A H R R 
R E D(A)L I R H G K 



(Hi) G C G G C G G C G 

-1 1 23456789 
R G P@L A R H G R 
R E D@L I R H G K 

FIG. 1B 
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F2 



6 3 2-16 3 
zfHae(M) EKRHHKR QSLD 

5-tatagtG G G M C GGCGt gtcacagtcagtccacacgtc-3 1 



atatcaC C M G G CCGCacag tgtcagtcaggtgtgcag 



F3 



F2 



zf Hha (M) 



6 32-16 3 
RKRAGDY RTLD 



G G Y C 



GGCGt gtcacagtcagtccacacgtc- 3 



5-tatagtG 

atatcaC C M R G CCGCacag t g t cag t cagg t g t gcag 



F3 



F2 



zf Hae (Y) 



6 32 -1 
RKRDRNH 



6 3 
GTLD 



5-tatagtG 
atatcaC 



uz/ 



GGCGt gtcacagtcagtccacacgtc- 3 



G G Y C 

C M R|G CCGCacag t g t cag t c agg t g t gcag 



zf Hha (Y) 



F3 

6 32 -1 
KTRTSNT 



F2 
6 3 
AS L H 



t at agt G 
atatcaC 



1 US 



G Y G C 



GGCGt gtcacagtcagtccacacgtc-3 



CRM G CCGCacag t g t cag t caggt gt gcag 



F3 F2 

6 32-16 3 
zfHae(T) TKRHDNR TSLD 



s'-tatagtG G G.J ClGGCGt gt cacagt cagt ccacacgt c- 3 1 



atatcaC C M [A] G CCGCacag t g t cag t cagg t g t gcag 
M"5-methylcytosme Y=pyrimidine (C/T/M) R=purine (A/G) 

FIG. 3 
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FROM.s^, D. VOUNG ,&„C0 , ( wii , , m „ t Ffl X NO.: + 44 0 23 89719890 1 1-93-01 16 = 93 P. 96 

FOR UtlMTY/OESlGN RULE 63 {37 C.F-R. 1 .63) PM & S 

C|P/PCT NATIONAL/PLANT DECLARATION AND POWER OF ATTORNEY FORM 

ORIGINAiiSUBSTlTUT^UPPLEMENTAU FOR PATENT APPLICATION 

DECLARATIONS IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

As a below named inventor, l Heresy declare thai my residence, post office address ana citizenship are as stated below next to my name, and I 
&*j>*ve ! am me original, first and sole Kwentor of only one name is listed oelow) or an original, fast aM joint inventor (if plwraf names are Usted 

Detow) of Ifte subject matter wftich * Claimed and for wh»cn a patent is eoagnt on tne iNVENTlON EMTm^D NUCLEIC ACID BtNPfNG 

PROTEINS 

Tfte gpecificairfui of wh*cn rOfECfC apohcaple BOXcES) 1 ' ~ 

X A □ ** artacnea hereto 

BOX(ES) 6. 0 was filed on September 18, 2000 as U S. Application No 08/646,353 



* C □ was filed as PCT international Application No PCT/ v on 

ana f rf apnleabta toUSof pCT application! was amended On 



i rtereby state mat t r»avg renewed ana unoanHartf Itm c&iifime of *bo»e iqenttfed spacificirtiOn. lauding the claims, as arnenqeq by any am^ndnwi refanfca TO 
aoove. I ec^nowiBoge pne ouiy to a&cio&a all infomv^win xrio^n to me to pe material to palBnttOtmy as defined in 37 C f R i .56 £u»pi a& noted wow i neraoy claim 
foreign priority penefScs under u.S C 1 1 Stand) or 365(Bj of £ny raretqn appliCiBiOn(&j toi pajefii or inventors certificate, or 365(a) of any PCT international 
Application wntcn dssionaieo ai one otner country man me umtsd Stares area Data* and ha^e aiao loenufieo beio* any foretfin application tor parent or tnventsrs 
cerUfcaie. or PCT inicm&uonal Appucauon. fi^ad ay me or my assignee fltfCtoiotfl the subject msmer claimed in in* application and nawng a filing dart p) odore mac o* 
me application on wrvcn prwrrty ^ asrfneo. or (2> if no priority cuwneo, before trie filino. a ate or Wi& apptcaboft 

PBtPR fQPPg^ App^-fCATlOHff | _ P«ntfirffM»f<»- Pftfr P^rmH| 

Number Coynja Day/HOfflft/Yftaf Flfe^ open or Publish** or Grants Priority NOT Claimed 

9805576.7 Greai Britain 1 77M&rCrV1993 

9806835 0 Gne« Briton 3i/Marcnn 998 

9807246.5 GrBarBmam D3jAprWim 



tf mare p^of foreign aooiicaripn. X ao* ar QOflom * no eoiifcnua on nacwaa aaoa 

e^sepi as rwtao kiow, i naracy oam ConwM pnorny oenafit und«r S5 u S" C 1l9{aj or 120 ana/or seSjc) of rtvp m4<cawfl uniied States ap^mbon's iiStea dciqw arrt 
PCT tnternationdi appljamonft bated apove or eeiow and. tt tna a conunuaiion-tn-pan (CiP ) eppucabon insorar as me euDjact matcnr dieciasea ana ctaimeo in ir^s 
appitcauan i& in 94d*oon io vtai dts0o4BO «n sjjcti pruv Aopacaaoiis, f acknowteoge tne duty to d^cttoe an intonnaDan known to me to oe materia (o pxxenm&uTy 
defined m 37 C F R l .$6 Mfnic^ became avatfstfti fcaeiween &ing date ct eacA 6ucn pnor appucation and ma national or PCT iniarnaxtonai fji^g date oi gua 
appiicaoon 



PRIOR Ci.S. P«OVISIOIHAV T NONPROVISIOKAL AND/OR PCT APPUCATIOWf S\ 
Appiieatjon No. fserj^ cadetsefiai nol Oav/MQNJH/Yaar f Had 



1 narc&y oeoara v^i a» »taKmanw rrwae r>erwi of m r OT»n Krwwteag* w« uve ans wai at) «3ram«ni* rnaoe on mtormaiion and txtutt are oenev^Q io t» uub. anq 
fudrwr mat trase Etaiernenu were mane wnn we knowledge o»i wilful Me swernents ana \m ftQ mage bjb pun»»r\aH« py ftne or imprisonment, or win unaef 
5m4K4i i ooi of Tina i A of me wmied Stales Cooe and thni »acn wiifwl tatee afAiemeno may ^eopanf^u tne vanacy of m« 9ppj*oib&n Qr any pajem is«w&o trsreon. 

Arm i rwriEey eppomi Pot^ry Mao«qn & s^tro luP. lnw«c=u*£u Property Group. -noO New Vwk AvtnuB. N.w Nrnm Floor. East W wa*ft,r>9ton. O C 20005^9 ie 
tfiifipnone numtwr (202) 86i -30oa tto ^nom ail conimunica&ons are to be orcoeaj. and no oejow-n*rn«j paraona (of m« aoarete) uvoivauaJiy ana concaivety my 
anorn&y* m p^nsecuie cru& spphcaun and to iran&aa a»i Dubrutui m tne Paie^c and Tracmmarx once connac&Ba tftei twtin ena win mo resulting pautnt. ana t rweny 
Butnonzb in^m io dei^ia nam«amum&e<v oetow ot persons no tonser vntn irwr finn and to act and rety on mstrue&ons rrom and convnunu^tfi a/eaty wm ine 
parson/«««9n«a/aaomay/?Srnv ersamzation wno/wfrcft fail ftends/&ont ttws caae 10 mem and Py wnQm/wnten I Hereby doctam mat i nave concerned afWr Ml apjewawe 
to pe fepi^emed unia«&Vyn(ri I tnarud me aDove Firrn anq/ora befo*r anofney m wnnng to ma comcary 



Paul N KohuliS 16773 

^4 aeymono F uppiti 175J9 
J^CG. Ltoyd Knignt 17fi98^ 
H Kevin E Joyce 205P8- 
~ v Oeorge M S.mia V8^JL 
Donew j. Bird 
Peter w Gowdey 
Pate S. Lazar 



Paul E wmtp, Jr 
Gienn J Perry 
KendrewH Cotton 
G Paul Edgetf 
kynn E Ecdeeton 
TimomyJ Klimd 
Q^fvioA Jafcopi 
i G. Paufscfn 



3 2071 Siepneo c £ta2ier 

Rutn N. Morducn 
303ga^ Ricnard h 2aiilen 
it23fflL_ Roger R. W« 
358S3_ jay M Finketotem 
34852 _ Michael R ttowonttyk 
329S5„ w. pstncfc BenQtsson 



31361 
310^ 
27248 

3T2PT 



A4amR Ncas 
William P Aifcirw 
PaufL Planer 



^1835 ., 
ifiOOa. 




3§m 

32456 

37f^ 



united Kingdom , 



FamhNama _ 
I united IfCingdjDm 



Residence 



Post Office Address 



c/o MRC LaDorauwy of Molecular Blolooy, Medical Research Coundl. MiHa Road. Cambridge CB2-2QM Unrtsd 
Kingdom 



T 



[ jinc^e Zip 



Cooe) 



U) INVENTOR'S SIGNATURE : 



J Maifc" 



Pa»: 



/ Fan^Name 
I United Ktngdq 



ftffit 



f United Kin o dom ^-^Pyjf 



United Kingdom 



Oouroy of CBgerohy 



Pea Office Adorass 



24 Snotrfieid Avanua East Sneen, London SW14 8EA, United KingdOT 



(include Zip Codq) 



for ADPmoNAL inventors. u X H box □ and proceed on me anacnea page 10 lisi eacn aaaiconai inventor 

□ See addrtional foreign priorities on attached page (incorporated herein by reference) 

Atty Dkt No. PM273884 
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Rule 56{a) & (b) = 37 C.F.R. 1.56(a) & (b) 
PATENT AND TRADEMARK CASES - RULES OF PRACTICE 
DUTY OF DISCLOSURE 

(a) ... Each individual associated with the filing and prosecution of a patent application has a duty of candor and 
good faith in dealing with the [Patent and Trademark] Office, which includes a duty to disclose to the Office alt 
information known to that individual to be material to patentability,. .(b) information is material to patentability 
when it is not cumulative and (1) It also establishes by itself, or in combination with other information, a prima 
fade case of unpatentability of a claim or (2) refutes, or is inconsistent with, a position the applicant takes in: (i) 
Opposing an argument of unpatentability relied on by the Office, or (ii) Asserting an argument of patentability 

PATENT LAWS 35 U.S.C. 

§102. Conditions for patentability; novelty and loss of right to patent 

A person shall be entitled to a patent unless- 

(a) the invention was known or used by others in this country, or patented or described in a printed publication in this 
or a foreign country, before the invention thereof by the applicant for patent or 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of the application for patent in the United States, or 

(c) he has abandoned the invention, or 

(d) the invention was first patented or caused to be patented, or was the subject of an inventor's certificate, by the 
applicant or his legal representatives or assigns in a foreign country prior to the date of the application for patent 
in this country on an application for patent or inventor's certificate filed more than twelve months* before the filing 
of the application in the United States, or 

(e) the invention was described in a patent granted on an application for patent by another filed in the United States 
before the invention thereof by the applicant for patent, or on an international application by another who has 
fulfilled the requirements of paragraphs (1), (2), and (4) of section 371(c) of this title before the "invention thereof 
by the applicant for patent, or 

(f) he did not himself invent the subject matter sought to be patented, or 

(g) before the applicant's invention thereof the invention was made in this country by another who had not 
abandoned, suppressed, or concealed it In determining priority of invention there shall be considered not only 
the respective dates of conception and reduction to practice of the invention, but also the reasonable diligence of 
one who was first to conceive and last to reduce to practice, from a time prior to conception by the other. 



§103. Condition for patentability; non-obvious subject matter 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, rf the differences between the subject matter sought to be patented and the prior 
art are such that the subject matter as a whole would have been obvious at the time the invention was 
made to a person having ordinary skill in the art to which said subject matter pertains, Patentability shall 
not be negatived by the manner in which the invention was made 

(c) Subject matter developed by another person, which qualified as prior art only under subsection (f) or (g) of 
section 102 of this title, shall not preclude patentability under this section where the subject matter and the 
claimed invention were, at the time the invention was made, owned by the same person or subject to an 
obligation of assignment to the same person. 



* Six momhs for Design Applications (35 U.S.C 1 72). 



